GSV-Cities: Toward Appropriate Supervised Visual Place Recognition

This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur...

Full description

Bibliographic Details
Published in:Neurocomputing
Main Authors: Ali-bey, Amar, Chaib-draa, Brahim, Giguère, Philippe
Format: Text
Language:unknown
Published: 2022
Subjects:
Online Access:http://arxiv.org/abs/2210.10239
https://doi.org/10.1016/j.neucom.2022.09.127
id ftarxivpreprints:oai:arXiv.org:2210.10239
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2210.10239 2023-09-05T13:21:15+02:00 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe 2022-10-18 http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127 unknown http://arxiv.org/abs/2210.10239 doi:10.1016/j.neucom.2022.09.127 Computer Science - Computer Vision and Pattern Recognition text 2022 ftarxivpreprints https://doi.org/10.1016/j.neucom.2022.09.127 2023-08-16T17:20:33Z This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022 Text Nordland Nordland Nordland ArXiv.org (Cornell University Library) Neurocomputing 513 194 203
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
topic_facet Computer Science - Computer Vision and Pattern Recognition
description This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022
format Text
author Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
author_facet Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
author_sort Ali-bey, Amar
title GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_short GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_full GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_fullStr GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_full_unstemmed GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_sort gsv-cities: toward appropriate supervised visual place recognition
publishDate 2022
url http://arxiv.org/abs/2210.10239
https://doi.org/10.1016/j.neucom.2022.09.127
genre Nordland
Nordland
Nordland
genre_facet Nordland
Nordland
Nordland
op_relation http://arxiv.org/abs/2210.10239
doi:10.1016/j.neucom.2022.09.127
op_doi https://doi.org/10.1016/j.neucom.2022.09.127
container_title Neurocomputing
container_volume 513
container_start_page 194
op_container_end_page 203
_version_ 1776201842814877696