GSV-Cities: Toward Appropriate Supervised Visual Place Recognition

This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur...

Full description

Bibliographic Details
Published in:	Neurocomputing
Main Authors:	Ali-bey, Amar, Chaib-draa, Brahim, Giguère, Philippe
Format:	Text
Language:	unknown
Published:	2022
Subjects:	Computer Science - Computer Vision and Pattern Recognition Nordland
Online Access:	http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127

id	ftarxivpreprints:oai:arXiv.org:2210.10239
record_format	openpolar
spelling	ftarxivpreprints:oai:arXiv.org:2210.10239 2023-09-05T13:21:15+02:00 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe 2022-10-18 http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127 unknown http://arxiv.org/abs/2210.10239 doi:10.1016/j.neucom.2022.09.127 Computer Science - Computer Vision and Pattern Recognition text 2022 ftarxivpreprints https://doi.org/10.1016/j.neucom.2022.09.127 2023-08-16T17:20:33Z This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022 Text Nordland Nordland Nordland ArXiv.org (Cornell University Library) Neurocomputing 513 194 203
institution	Open Polar
collection	ArXiv.org (Cornell University Library)
op_collection_id	ftarxivpreprints
language	unknown
topic	Computer Science - Computer Vision and Pattern Recognition
spellingShingle	Computer Science - Computer Vision and Pattern Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
topic_facet	Computer Science - Computer Vision and Pattern Recognition
description	This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022
format	Text
author	Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe
author_facet	Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe
author_sort	Ali-bey, Amar
title	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_short	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_full	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_fullStr	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_full_unstemmed	GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
title_sort	gsv-cities: toward appropriate supervised visual place recognition
publishDate	2022
url	http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127
genre	Nordland Nordland Nordland
genre_facet	Nordland Nordland Nordland
op_relation	http://arxiv.org/abs/2210.10239 doi:10.1016/j.neucom.2022.09.127
op_doi	https://doi.org/10.1016/j.neucom.2022.09.127
container_title	Neurocomputing
container_volume	513
container_start_page	194
op_container_end_page	203
_version_	1776201842814877696

GSV-Cities: Toward Appropriate Supervised Visual Place Recognition

Similar Items