The recent advancements in generative AI techniques, which have significantly increased the online dissemination of altered images and videos, have raised serious concerns about the credibility of digital media available on the Internet and distributed through information channels and social networks. This issue particularly affects domains that rely heavily on trustworthy data, such as journalism, forensic analysis, and Earth observation. To address these concerns, the ability to geolocate a non-geo-tagged ground-view image without external information, such as GPS coordinates, has become increasingly critical. This study tackles the challenge of linking a ground-view image, potentially exhibiting varying fields of view (FoV), to its corresponding satellite image without the aid of GPS data. To achieve this, we propose a novel four-stream Siamese-like architecture, the Quadruple Semantic Align Net (SAN-QUAD), which extends previous state-of-the-art (SOTA) approaches by leveraging semantic segmentation applied to both ground and satellite imagery. Experimental results on a subset of the CVUSA dataset demonstrate significant improvements of up to 9.8% over prior methods across various FoV settings.

Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation / Mule, E.; Pannacci, M.; Goudarzi, A. G.; Pro, F.; Papa, L.; Maiano, L.; Amerini, I.. - (2025), pp. 747-755. ( 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 Tucson; USA ) [10.1109/WACVW65960.2025.00089].

Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation

Mule E.
;
Pro F.;Papa L.;Maiano L.;Amerini I.
2025

Abstract

The recent advancements in generative AI techniques, which have significantly increased the online dissemination of altered images and videos, have raised serious concerns about the credibility of digital media available on the Internet and distributed through information channels and social networks. This issue particularly affects domains that rely heavily on trustworthy data, such as journalism, forensic analysis, and Earth observation. To address these concerns, the ability to geolocate a non-geo-tagged ground-view image without external information, such as GPS coordinates, has become increasingly critical. This study tackles the challenge of linking a ground-view image, potentially exhibiting varying fields of view (FoV), to its corresponding satellite image without the aid of GPS data. To achieve this, we propose a novel four-stream Siamese-like architecture, the Quadruple Semantic Align Net (SAN-QUAD), which extends previous state-of-the-art (SOTA) approaches by leveraging semantic segmentation applied to both ground and satellite imagery. Experimental results on a subset of the CVUSA dataset demonstrate significant improvements of up to 9.8% over prior methods across various FoV settings.
2025
2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025
Aerial images; AI techniques; Earth observations; External informations; Field of views; Forensic analysis; Information channels; Matchings; Online dissemination; Semantic segmentation
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation / Mule, E.; Pannacci, M.; Goudarzi, A. G.; Pro, F.; Papa, L.; Maiano, L.; Amerini, I.. - (2025), pp. 747-755. ( 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 Tucson; USA ) [10.1109/WACVW65960.2025.00089].
File allegati a questo prodotto
File Dimensione Formato  
Mule_postprint_Enhancing-Ground_2025.pdf

accesso aperto

Note: DOI: 10.1109/WACVW65960.2025.00089
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.64 MB
Formato Adobe PDF
1.64 MB Adobe PDF
Mule_Enhancing_Ground-to-Aerial_2025.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.58 MB
Formato Adobe PDF
1.58 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1756220
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact