Non-isotropy Regularization for Proxy-based Deep Metric Learning
Deep Metric Learning (DML) aims to learn representation spaces on which semantic relations can simply be expressed through predefined distance metrics. Best performing approaches commonly leverage class proxies as sample stand-ins for better convergence and generalization. However, these proxy-metho...
Main Authors: | , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2022
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2203.08547 |
id |
ftarxivpreprints:oai:arXiv.org:2203.08547 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2203.08547 2023-09-05T13:19:06+02:00 Non-isotropy Regularization for Proxy-based Deep Metric Learning Roth, Karsten Vinyals, Oriol Akata, Zeynep 2022-03-16 http://arxiv.org/abs/2203.08547 unknown http://arxiv.org/abs/2203.08547 Computer Science - Computer Vision and Pattern Recognition text 2022 ftarxivpreprints 2023-08-16T16:58:52Z Deep Metric Learning (DML) aims to learn representation spaces on which semantic relations can simply be expressed through predefined distance metrics. Best performing approaches commonly leverage class proxies as sample stand-ins for better convergence and generalization. However, these proxy-methods solely optimize for sample-proxy distances. Given the inherent non-bijectiveness of used distance functions, this can induce locally isotropic sample distributions, leading to crucial semantic context being missed due to difficulties resolving local structures and intraclass relations between samples. To alleviate this problem, we propose non-isotropy regularization ($\mathbb{NIR}$) for proxy-based Deep Metric Learning. By leveraging Normalizing Flows, we enforce unique translatability of samples from their respective class proxies. This allows us to explicitly induce a non-isotropic distribution of samples around a proxy to optimize for. In doing so, we equip proxy-based objectives to better learn local structures. Extensive experiments highlight consistent generalization benefits of $\mathbb{NIR}$ while achieving competitive and state-of-the-art performance on the standard benchmarks CUB200-2011, Cars196 and Stanford Online Products. In addition, we find the superior convergence properties of proxy-based methods to still be retained or even improved, making $\mathbb{NIR}$ very attractive for practical usage. Code available at https://github.com/ExplainableML/NonIsotropicProxyDML. Comment: Accepted to CVPR 2022 Text DML ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Roth, Karsten Vinyals, Oriol Akata, Zeynep Non-isotropy Regularization for Proxy-based Deep Metric Learning |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition |
description |
Deep Metric Learning (DML) aims to learn representation spaces on which semantic relations can simply be expressed through predefined distance metrics. Best performing approaches commonly leverage class proxies as sample stand-ins for better convergence and generalization. However, these proxy-methods solely optimize for sample-proxy distances. Given the inherent non-bijectiveness of used distance functions, this can induce locally isotropic sample distributions, leading to crucial semantic context being missed due to difficulties resolving local structures and intraclass relations between samples. To alleviate this problem, we propose non-isotropy regularization ($\mathbb{NIR}$) for proxy-based Deep Metric Learning. By leveraging Normalizing Flows, we enforce unique translatability of samples from their respective class proxies. This allows us to explicitly induce a non-isotropic distribution of samples around a proxy to optimize for. In doing so, we equip proxy-based objectives to better learn local structures. Extensive experiments highlight consistent generalization benefits of $\mathbb{NIR}$ while achieving competitive and state-of-the-art performance on the standard benchmarks CUB200-2011, Cars196 and Stanford Online Products. In addition, we find the superior convergence properties of proxy-based methods to still be retained or even improved, making $\mathbb{NIR}$ very attractive for practical usage. Code available at https://github.com/ExplainableML/NonIsotropicProxyDML. Comment: Accepted to CVPR 2022 |
format |
Text |
author |
Roth, Karsten Vinyals, Oriol Akata, Zeynep |
author_facet |
Roth, Karsten Vinyals, Oriol Akata, Zeynep |
author_sort |
Roth, Karsten |
title |
Non-isotropy Regularization for Proxy-based Deep Metric Learning |
title_short |
Non-isotropy Regularization for Proxy-based Deep Metric Learning |
title_full |
Non-isotropy Regularization for Proxy-based Deep Metric Learning |
title_fullStr |
Non-isotropy Regularization for Proxy-based Deep Metric Learning |
title_full_unstemmed |
Non-isotropy Regularization for Proxy-based Deep Metric Learning |
title_sort |
non-isotropy regularization for proxy-based deep metric learning |
publishDate |
2022 |
url |
http://arxiv.org/abs/2203.08547 |
genre |
DML |
genre_facet |
DML |
op_relation |
http://arxiv.org/abs/2203.08547 |
_version_ |
1776199925182234624 |