Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach

We tackle the task of cross-domain visual geo-localization, where the goal is to geo-localize a given query image against a database of geo-tagged images, in the case where the query and the database belong to different visual domains. In particular, at training time, we consider having access to on...

Full description

Bibliographic Details
Main Authors: Berton, G. (Author), Caputo, B. (Author), Masone, C. (Author), Montagna, F. (Author), Paolicelli, V. (Author)
Format: Article
Language:English
Published: Frontiers Media S.A. 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 01862nam a2200241Ia 4500
001 10.3389-fcomp.2022.841817
008 220718s2022 CNT 000 0 und d
020 |a 26249898 (ISSN) 
245 1 0 |a Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach 
260 0 |b Frontiers Media S.A.  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3389/fcomp.2022.841817 
520 3 |a We tackle the task of cross-domain visual geo-localization, where the goal is to geo-localize a given query image against a database of geo-tagged images, in the case where the query and the database belong to different visual domains. In particular, at training time, we consider having access to only few unlabeled queries from the target domain. To adapt our deep neural network to the database distribution, we rely on a 2-fold domain adaptation technique, based on a hybrid generative-discriminative approach. To further enhance the architecture, and to ensure robustness across domains, we employ a novel attention layer that can easily be plugged into existing architectures. Through a large number of experiments, we show that this adaptive-attentive approach makes the model robust to large domain shifts, such as unseen cities or weather conditions. Finally, we propose a new large-scale dataset for cross-domain visual geo-localization, called SVOX. Copyright © 2022 Paolicelli, Berton, Montagna, Masone and Caputo. 
650 0 4 |a domain adaptation (DA) 
650 0 4 |a domain generalization 
650 0 4 |a few-shot domain adaptation 
650 0 4 |a visual geolocalization 
650 0 4 |a visual place recognition (VPR) 
700 1 |a Berton, G.  |e author 
700 1 |a Caputo, B.  |e author 
700 1 |a Masone, C.  |e author 
700 1 |a Montagna, F.  |e author 
700 1 |a Paolicelli, V.  |e author 
773 |t Frontiers in Computer Science  |x 26249898 (ISSN)  |g 4