Deep Metric Learning with Density Adaptivity
The problem of distance metric learning is mostly considered from the perspective of learning an embedding space, where the distances between pairs of examples are in correspondence with a similarity metric. With the rise and success of Convolutional Neural Networks (CNN), deep metric learning (DML)...
Main Authors: | , , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2019
|
Subjects: | |
Online Access: | http://arxiv.org/abs/1909.03909 |
id |
ftarxivpreprints:oai:arXiv.org:1909.03909 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:1909.03909 2023-09-05T13:19:05+02:00 Deep Metric Learning with Density Adaptivity Li, Yehao Yao, Ting Pan, Yingwei Chao, Hongyang Mei, Tao 2019-09-09 http://arxiv.org/abs/1909.03909 unknown http://arxiv.org/abs/1909.03909 Computer Science - Computer Vision and Pattern Recognition Computer Science - Multimedia text 2019 ftarxivpreprints 2023-08-16T15:30:15Z The problem of distance metric learning is mostly considered from the perspective of learning an embedding space, where the distances between pairs of examples are in correspondence with a similarity metric. With the rise and success of Convolutional Neural Networks (CNN), deep metric learning (DML) involves training a network to learn a nonlinear transformation to the embedding space. Existing DML approaches often express the supervision through maximizing inter-class distance and minimizing intra-class variation. However, the results can suffer from overfitting problem, especially when the training examples of each class are embedded together tightly and the density of each class is very high. In this paper, we integrate density, i.e., the measure of data concentration in the representation, into the optimization of DML frameworks to adaptively balance inter-class similarity and intra-class variation by training the architecture in an end-to-end manner. Technically, the knowledge of density is employed as a regularizer, which is pluggable to any DML architecture with different objective functions such as contrastive loss, N-pair loss and triplet loss. Extensive experiments on three public datasets consistently demonstrate clear improvements by amending three types of embedding with the density adaptivity. More remarkably, our proposal increases Recall@1 from 67.95% to 77.62%, from 52.01% to 55.64% and from 68.20% to 70.56% on Cars196, CUB-200-2011 and Stanford Online Products dataset, respectively. Comment: Accepted by IEEE Transactions on Multimedia Text DML ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Multimedia |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Multimedia Li, Yehao Yao, Ting Pan, Yingwei Chao, Hongyang Mei, Tao Deep Metric Learning with Density Adaptivity |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition Computer Science - Multimedia |
description |
The problem of distance metric learning is mostly considered from the perspective of learning an embedding space, where the distances between pairs of examples are in correspondence with a similarity metric. With the rise and success of Convolutional Neural Networks (CNN), deep metric learning (DML) involves training a network to learn a nonlinear transformation to the embedding space. Existing DML approaches often express the supervision through maximizing inter-class distance and minimizing intra-class variation. However, the results can suffer from overfitting problem, especially when the training examples of each class are embedded together tightly and the density of each class is very high. In this paper, we integrate density, i.e., the measure of data concentration in the representation, into the optimization of DML frameworks to adaptively balance inter-class similarity and intra-class variation by training the architecture in an end-to-end manner. Technically, the knowledge of density is employed as a regularizer, which is pluggable to any DML architecture with different objective functions such as contrastive loss, N-pair loss and triplet loss. Extensive experiments on three public datasets consistently demonstrate clear improvements by amending three types of embedding with the density adaptivity. More remarkably, our proposal increases Recall@1 from 67.95% to 77.62%, from 52.01% to 55.64% and from 68.20% to 70.56% on Cars196, CUB-200-2011 and Stanford Online Products dataset, respectively. Comment: Accepted by IEEE Transactions on Multimedia |
format |
Text |
author |
Li, Yehao Yao, Ting Pan, Yingwei Chao, Hongyang Mei, Tao |
author_facet |
Li, Yehao Yao, Ting Pan, Yingwei Chao, Hongyang Mei, Tao |
author_sort |
Li, Yehao |
title |
Deep Metric Learning with Density Adaptivity |
title_short |
Deep Metric Learning with Density Adaptivity |
title_full |
Deep Metric Learning with Density Adaptivity |
title_fullStr |
Deep Metric Learning with Density Adaptivity |
title_full_unstemmed |
Deep Metric Learning with Density Adaptivity |
title_sort |
deep metric learning with density adaptivity |
publishDate |
2019 |
url |
http://arxiv.org/abs/1909.03909 |
genre |
DML |
genre_facet |
DML |
op_relation |
http://arxiv.org/abs/1909.03909 |
_version_ |
1776199895943741440 |