GSV-Cities: Toward Appropriate Supervised Visual Place Recognition
This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur...
Published in: | Neurocomputing |
---|---|
Main Authors: | , , |
Format: | Text |
Language: | unknown |
Published: |
2022
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127 |
id |
ftarxivpreprints:oai:arXiv.org:2210.10239 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2210.10239 2023-09-05T13:21:15+02:00 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe 2022-10-18 http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127 unknown http://arxiv.org/abs/2210.10239 doi:10.1016/j.neucom.2022.09.127 Computer Science - Computer Vision and Pattern Recognition text 2022 ftarxivpreprints https://doi.org/10.1016/j.neucom.2022.09.127 2023-08-16T17:20:33Z This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022 Text Nordland Nordland Nordland ArXiv.org (Cornell University Library) Neurocomputing 513 194 203 |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition |
description |
This paper aims to investigate representation learning for large scale visual place recognition, which consists of determining the location depicted in a query image by referring to a database of reference images. This is a challenging task due to the large-scale environmental changes that can occur over time (i.e., weather, illumination, season, traffic, occlusion). Progress is currently challenged by the lack of large databases with accurate ground truth. To address this challenge, we introduce GSV-Cities, a new image dataset providing the widest geographic coverage to date with highly accurate ground truth, covering more than 40 cities across all continents over a 14-year period. We subsequently explore the full potential of recent advances in deep metric learning to train networks specifically for place recognition, and evaluate how different loss functions influence performance. In addition, we show that performance of existing methods substantially improves when trained on GSV-Cities. Finally, we introduce a new fully convolutional aggregation layer that outperforms existing techniques, including GeM, NetVLAD and CosPlace, and establish a new state-of-the-art on large-scale benchmarks, such as Pittsburgh, Mapillary-SLS, SPED and Nordland. The dataset and code are available for research purposes at https://github.com/amaralibey/gsv-cities. Comment: Neurocomputing 2022 |
format |
Text |
author |
Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe |
author_facet |
Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe |
author_sort |
Ali-bey, Amar |
title |
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
title_short |
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
title_full |
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
title_fullStr |
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
title_full_unstemmed |
GSV-Cities: Toward Appropriate Supervised Visual Place Recognition |
title_sort |
gsv-cities: toward appropriate supervised visual place recognition |
publishDate |
2022 |
url |
http://arxiv.org/abs/2210.10239 https://doi.org/10.1016/j.neucom.2022.09.127 |
genre |
Nordland Nordland Nordland |
genre_facet |
Nordland Nordland Nordland |
op_relation |
http://arxiv.org/abs/2210.10239 doi:10.1016/j.neucom.2022.09.127 |
op_doi |
https://doi.org/10.1016/j.neucom.2022.09.127 |
container_title |
Neurocomputing |
container_volume |
513 |
container_start_page |
194 |
op_container_end_page |
203 |
_version_ |
1776201842814877696 |