Generalized Sum Pooling for Metric Learning
A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as repres...
Main Authors: | , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2023
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2308.09228 |
id |
ftarxivpreprints:oai:arXiv.org:2308.09228 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2308.09228 2023-09-26T15:17:35+02:00 Generalized Sum Pooling for Metric Learning Gurbuz, Yeti Z. Sener, Ozan Alatan, A. Aydın 2023-08-17 http://arxiv.org/abs/2308.09228 unknown http://arxiv.org/abs/2308.09228 Computer Science - Computer Vision and Pattern Recognition Computer Science - Machine Learning Statistics - Machine Learning text 2023 ftarxivpreprints 2023-08-27T01:06:18Z A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as representing a different semantic entity and GAP as a convex combination of them. Following this perspective, we generalize GAP and propose a learnable generalized sum pooling method (GSP). GSP improves GAP with two distinct abilities: i) the ability to choose a subset of semantic entities, effectively learning to ignore nuisance information, and ii) learning the weights corresponding to the importance of each entity. Formally, we propose an entropy-smoothed optimal transport problem and show that it is a strict generalization of GAP, i.e., a specific realization of the problem gives back GAP. We show that this optimization problem enjoys analytical gradients enabling us to use it as a direct learnable replacement for GAP. We further propose a zero-shot loss to ease the learning of GSP. We show the effectiveness of our method with extensive evaluations on 4 popular metric learning benchmarks. Code is available at: GSP-DML Framework Comment: Accepted as a conference paper at International Conference on Computer Vision (ICCV) 2023 Text DML ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Machine Learning Statistics - Machine Learning |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Machine Learning Statistics - Machine Learning Gurbuz, Yeti Z. Sener, Ozan Alatan, A. Aydın Generalized Sum Pooling for Metric Learning |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Machine Learning Statistics - Machine Learning |
description |
A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as representing a different semantic entity and GAP as a convex combination of them. Following this perspective, we generalize GAP and propose a learnable generalized sum pooling method (GSP). GSP improves GAP with two distinct abilities: i) the ability to choose a subset of semantic entities, effectively learning to ignore nuisance information, and ii) learning the weights corresponding to the importance of each entity. Formally, we propose an entropy-smoothed optimal transport problem and show that it is a strict generalization of GAP, i.e., a specific realization of the problem gives back GAP. We show that this optimization problem enjoys analytical gradients enabling us to use it as a direct learnable replacement for GAP. We further propose a zero-shot loss to ease the learning of GSP. We show the effectiveness of our method with extensive evaluations on 4 popular metric learning benchmarks. Code is available at: GSP-DML Framework Comment: Accepted as a conference paper at International Conference on Computer Vision (ICCV) 2023 |
format |
Text |
author |
Gurbuz, Yeti Z. Sener, Ozan Alatan, A. Aydın |
author_facet |
Gurbuz, Yeti Z. Sener, Ozan Alatan, A. Aydın |
author_sort |
Gurbuz, Yeti Z. |
title |
Generalized Sum Pooling for Metric Learning |
title_short |
Generalized Sum Pooling for Metric Learning |
title_full |
Generalized Sum Pooling for Metric Learning |
title_fullStr |
Generalized Sum Pooling for Metric Learning |
title_full_unstemmed |
Generalized Sum Pooling for Metric Learning |
title_sort |
generalized sum pooling for metric learning |
publishDate |
2023 |
url |
http://arxiv.org/abs/2308.09228 |
genre |
DML |
genre_facet |
DML |
op_relation |
http://arxiv.org/abs/2308.09228 |
_version_ |
1778139065711853568 |