DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning

Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown te...

Full description

Bibliographic Details
Main Authors:	Milbich, Timo, Roth, Karsten, Bharadhwaj, Homanga, Sinha, Samarth, Bengio, Yoshua, Ommer, Björn, Cohen, Joseph Paul
Format:	Text
Language:	unknown
Published:	2020
Subjects:	Computer Science - Computer Vision and Pattern Recognition DML
Online Access:	http://arxiv.org/abs/2004.13458

id	ftarxivpreprints:oai:arXiv.org:2004.13458
record_format	openpolar
spelling	ftarxivpreprints:oai:arXiv.org:2004.13458 2023-09-05T13:19:05+02:00 DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning Milbich, Timo Roth, Karsten Bharadhwaj, Homanga Sinha, Samarth Bengio, Yoshua Ommer, Björn Cohen, Joseph Paul 2020-04-28 http://arxiv.org/abs/2004.13458 unknown http://arxiv.org/abs/2004.13458 Computer Science - Computer Vision and Pattern Recognition text 2020 ftarxivpreprints 2023-08-16T15:51:18Z Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, which typically results in representations specialized in separating training classes. For effective generalization, however, such an image representation needs to capture a diverse range of data characteristics. To this end, we propose and study multiple complementary learning tasks, targeting conceptually different data relationships by only resorting to the available training samples and labels of a standard DML setting. Through simultaneous optimization of our tasks we learn a single model to aggregate their training signals, resulting in strong generalization and state-of-the-art performance on multiple established DML benchmark datasets. Comment: published at ECCV 2020 Text DML ArXiv.org (Cornell University Library)
institution	Open Polar
collection	ArXiv.org (Cornell University Library)
op_collection_id	ftarxivpreprints
language	unknown
topic	Computer Science - Computer Vision and Pattern Recognition
spellingShingle	Computer Science - Computer Vision and Pattern Recognition Milbich, Timo Roth, Karsten Bharadhwaj, Homanga Sinha, Samarth Bengio, Yoshua Ommer, Björn Cohen, Joseph Paul DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
topic_facet	Computer Science - Computer Vision and Pattern Recognition
description	Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, which typically results in representations specialized in separating training classes. For effective generalization, however, such an image representation needs to capture a diverse range of data characteristics. To this end, we propose and study multiple complementary learning tasks, targeting conceptually different data relationships by only resorting to the available training samples and labels of a standard DML setting. Through simultaneous optimization of our tasks we learn a single model to aggregate their training signals, resulting in strong generalization and state-of-the-art performance on multiple established DML benchmark datasets. Comment: published at ECCV 2020
format	Text
author	Milbich, Timo Roth, Karsten Bharadhwaj, Homanga Sinha, Samarth Bengio, Yoshua Ommer, Björn Cohen, Joseph Paul
author_facet	Milbich, Timo Roth, Karsten Bharadhwaj, Homanga Sinha, Samarth Bengio, Yoshua Ommer, Björn Cohen, Joseph Paul
author_sort	Milbich, Timo
title	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
title_short	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
title_full	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
title_fullStr	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
title_full_unstemmed	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
title_sort	diva: diverse visual feature aggregation for deep metric learning
publishDate	2020
url	http://arxiv.org/abs/2004.13458
genre	DML
genre_facet	DML
op_relation	http://arxiv.org/abs/2004.13458
_version_	1776199892958445568

DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning

Similar Items