Rethinking Deep Contrastive Learning with Embedding Memory
Pair-wise loss functions have been extensively studied and shown to continuously improve the performance of deep metric learning (DML). However, they are primarily designed with intuition based on simple toy examples, and experimentally identifying the truly effective design is difficult in complica...
Main Authors: | , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2021
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2103.14003 |
id |
ftarxivpreprints:oai:arXiv.org:2103.14003 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2103.14003 2023-09-05T13:19:05+02:00 Rethinking Deep Contrastive Learning with Embedding Memory Zhang, Haozhi Wang, Xun Huang, Weilin Scott, Matthew R. 2021-03-25 http://arxiv.org/abs/2103.14003 unknown http://arxiv.org/abs/2103.14003 Computer Science - Computer Vision and Pattern Recognition text 2021 ftarxivpreprints 2023-08-16T16:24:46Z Pair-wise loss functions have been extensively studied and shown to continuously improve the performance of deep metric learning (DML). However, they are primarily designed with intuition based on simple toy examples, and experimentally identifying the truly effective design is difficult in complicated, real-world cases. In this paper, we provide a new methodology for systematically studying weighting strategies of various pair-wise loss functions, and rethink pair weighting with an embedding memory. We delve into the weighting mechanisms by decomposing the pair-wise functions, and study positive and negative weights separately using direct weight assignment. This allows us to study various weighting functions deeply and systematically via weight curves, and identify a number of meaningful, comprehensive and insightful facts, which come up with our key observation on memory-based DML: it is critical to mine hard negatives and discard easy negatives which are less informative and redundant, but weighting on positive pairs is not helpful. This results in an efficient but surprisingly simple rule to design the weighting scheme, making it significantly different from existing mini-batch based methods which design various sophisticated loss functions to weight pairs carefully. Finally, we conduct extensive experiments on three large-scale visual retrieval benchmarks, and demonstrate the superiority of memory-based DML over recent mini-batch based approaches, by using a simple contrastive loss with momentum-updated memory. Comment: Under review Text DML ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Computer Vision and Pattern Recognition |
spellingShingle |
Computer Science - Computer Vision and Pattern Recognition Zhang, Haozhi Wang, Xun Huang, Weilin Scott, Matthew R. Rethinking Deep Contrastive Learning with Embedding Memory |
topic_facet |
Computer Science - Computer Vision and Pattern Recognition |
description |
Pair-wise loss functions have been extensively studied and shown to continuously improve the performance of deep metric learning (DML). However, they are primarily designed with intuition based on simple toy examples, and experimentally identifying the truly effective design is difficult in complicated, real-world cases. In this paper, we provide a new methodology for systematically studying weighting strategies of various pair-wise loss functions, and rethink pair weighting with an embedding memory. We delve into the weighting mechanisms by decomposing the pair-wise functions, and study positive and negative weights separately using direct weight assignment. This allows us to study various weighting functions deeply and systematically via weight curves, and identify a number of meaningful, comprehensive and insightful facts, which come up with our key observation on memory-based DML: it is critical to mine hard negatives and discard easy negatives which are less informative and redundant, but weighting on positive pairs is not helpful. This results in an efficient but surprisingly simple rule to design the weighting scheme, making it significantly different from existing mini-batch based methods which design various sophisticated loss functions to weight pairs carefully. Finally, we conduct extensive experiments on three large-scale visual retrieval benchmarks, and demonstrate the superiority of memory-based DML over recent mini-batch based approaches, by using a simple contrastive loss with momentum-updated memory. Comment: Under review |
format |
Text |
author |
Zhang, Haozhi Wang, Xun Huang, Weilin Scott, Matthew R. |
author_facet |
Zhang, Haozhi Wang, Xun Huang, Weilin Scott, Matthew R. |
author_sort |
Zhang, Haozhi |
title |
Rethinking Deep Contrastive Learning with Embedding Memory |
title_short |
Rethinking Deep Contrastive Learning with Embedding Memory |
title_full |
Rethinking Deep Contrastive Learning with Embedding Memory |
title_fullStr |
Rethinking Deep Contrastive Learning with Embedding Memory |
title_full_unstemmed |
Rethinking Deep Contrastive Learning with Embedding Memory |
title_sort |
rethinking deep contrastive learning with embedding memory |
publishDate |
2021 |
url |
http://arxiv.org/abs/2103.14003 |
genre |
DML |
genre_facet |
DML |
op_relation |
http://arxiv.org/abs/2103.14003 |
_version_ |
1776199901816815616 |