Image Retrieval Based on Learning to Rank and Multiple Loss

Image retrieval applying deep convolutional features has achieved the most advanced performance in most standard benchmark tests. In image retrieval, deep metric learning (DML) plays a key role and aims to capture semantic similarity information carried by data points. However, two factors may imped...

Full description

Bibliographic Details
Published in:ISPRS International Journal of Geo-Information
Main Authors: Lili Fan, Hongwei Zhao, Haoyu Zhao, Pingping Liu, Huangshui Hu
Format: Article in Journal/Newspaper
Language:English
Published: MDPI AG 2019
Subjects:
DML
Online Access:https://doi.org/10.3390/ijgi8090393
https://doaj.org/article/245a2f35d37242a599a15c464c642d79
Description
Summary:Image retrieval applying deep convolutional features has achieved the most advanced performance in most standard benchmark tests. In image retrieval, deep metric learning (DML) plays a key role and aims to capture semantic similarity information carried by data points. However, two factors may impede the accuracy of image retrieval. First, when learning the similarity of negative examples, current methods separate negative pairs into equal distance in the embedding space. Thus, the intraclass data distribution might be missed. Second, given a query, either a fraction of data points, or all of them, are incorporated to build up the similarity structure, which makes it rather complex to calculate similarity or to choose example pairs. In this study, in order to achieve more accurate image retrieval, we proposed a method based on learning to rank and multiple loss (LRML). To address the first problem, through learning the ranking sequence, we separate the negative pairs from the query image into different distance. To tackle the second problem, we used a positive example in the gallery and negative sets from the bottom five ranked by similarity, thereby enhancing training efficiency. Our significant experimental results demonstrate that the proposed method achieves state-of-the-art performance on three widely used benchmarks.