Global Proxy-based Hard Mining for Visual Place Recognition

Learning deep representations for visual place recognition is commonly performed using pairwise or triple loss functions that highly depend on the hardness of the examples sampled at each training iteration. Existing techniques address this by using computationally and memory expensive offline hard...

Full description

Bibliographic Details
Main Authors: Ali-bey, Amar, Chaib-draa, Brahim, Giguère, Philippe
Format: Text
Language:unknown
Published: 2023
Subjects:
Online Access:http://arxiv.org/abs/2302.14217
id ftarxivpreprints:oai:arXiv.org:2302.14217
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2302.14217 2023-09-05T13:21:15+02:00 Global Proxy-based Hard Mining for Visual Place Recognition Ali-bey, Amar Chaib-draa, Brahim Giguère, Philippe 2023-02-27 http://arxiv.org/abs/2302.14217 unknown http://arxiv.org/abs/2302.14217 Computer Science - Computer Vision and Pattern Recognition text 2023 ftarxivpreprints 2023-08-16T17:33:49Z Learning deep representations for visual place recognition is commonly performed using pairwise or triple loss functions that highly depend on the hardness of the examples sampled at each training iteration. Existing techniques address this by using computationally and memory expensive offline hard mining, which consists of identifying, at each iteration, the hardest samples from the training set. In this paper we introduce a new technique that performs global hard mini-batch sampling based on proxies. To do so, we add a new end-to-end trainable branch to the network, which generates efficient place descriptors (one proxy for each place). These proxy representations are thus used to construct a global index that encompasses the similarities between all places in the dataset, allowing for highly informative mini-batch sampling at each training iteration. Our method can be used in combination with all existing pairwise and triplet loss functions with negligible additional memory and computation cost. We run extensive ablation studies and show that our technique brings new state-of-the-art performance on multiple large-scale benchmarks such as Pittsburgh, Mapillary-SLS and SPED. In particular, our method provides more than 100% relative improvement on the challenging Nordland dataset. Our code is available at https://github.com/amaralibey/GPM Comment: Accepted at BMVC 2022 Text Nordland Nordland Nordland ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
Global Proxy-based Hard Mining for Visual Place Recognition
topic_facet Computer Science - Computer Vision and Pattern Recognition
description Learning deep representations for visual place recognition is commonly performed using pairwise or triple loss functions that highly depend on the hardness of the examples sampled at each training iteration. Existing techniques address this by using computationally and memory expensive offline hard mining, which consists of identifying, at each iteration, the hardest samples from the training set. In this paper we introduce a new technique that performs global hard mini-batch sampling based on proxies. To do so, we add a new end-to-end trainable branch to the network, which generates efficient place descriptors (one proxy for each place). These proxy representations are thus used to construct a global index that encompasses the similarities between all places in the dataset, allowing for highly informative mini-batch sampling at each training iteration. Our method can be used in combination with all existing pairwise and triplet loss functions with negligible additional memory and computation cost. We run extensive ablation studies and show that our technique brings new state-of-the-art performance on multiple large-scale benchmarks such as Pittsburgh, Mapillary-SLS and SPED. In particular, our method provides more than 100% relative improvement on the challenging Nordland dataset. Our code is available at https://github.com/amaralibey/GPM Comment: Accepted at BMVC 2022
format Text
author Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
author_facet Ali-bey, Amar
Chaib-draa, Brahim
Giguère, Philippe
author_sort Ali-bey, Amar
title Global Proxy-based Hard Mining for Visual Place Recognition
title_short Global Proxy-based Hard Mining for Visual Place Recognition
title_full Global Proxy-based Hard Mining for Visual Place Recognition
title_fullStr Global Proxy-based Hard Mining for Visual Place Recognition
title_full_unstemmed Global Proxy-based Hard Mining for Visual Place Recognition
title_sort global proxy-based hard mining for visual place recognition
publishDate 2023
url http://arxiv.org/abs/2302.14217
genre Nordland
Nordland
Nordland
genre_facet Nordland
Nordland
Nordland
op_relation http://arxiv.org/abs/2302.14217
_version_ 1776201842353504256