Online multi-modal distance metric learning with application to image retrieval

Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or...

Full description

Bibliographic Details
Main Authors: WU, Pengcheng, HOI, Steven C. H., ZHAO, Peilin, MIAO, Chunyan, LIU, Zhi-Yong
Format: Text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
DML
Online Access:https://ink.library.smu.edu.sg/sis_research/2924
https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=3924&context=sis_research
id ftsingaporemuniv:oai:ink.library.smu.edu.sg:sis_research-3924
record_format openpolar
spelling ftsingaporemuniv:oai:ink.library.smu.edu.sg:sis_research-3924 2023-05-15T16:01:19+02:00 Online multi-modal distance metric learning with application to image retrieval WU, Pengcheng HOI, Steven C. H. ZHAO, Peilin MIAO, Chunyan LIU, Zhi-Yong 2016-02-01T08:00:00Z application/pdf https://ink.library.smu.edu.sg/sis_research/2924 https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=3924&context=sis_research eng eng Institutional Knowledge at Singapore Management University https://ink.library.smu.edu.sg/sis_research/2924 https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=3924&context=sis_research http://creativecommons.org/licenses/by-nc-nd/4.0/ CC-BY-NC-ND Research Collection School Of Computing and Information Systems content-based image retrieval multi-modal retrieval distance metric learning online learning Databases and Information Systems text 2016 ftsingaporemuniv 2021-08-31T17:38:45Z Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or a combined feature space where multiple types of features are simply concatenated. Such single-modal DML methods suffer from some critical limitations: (i) some type of features may significantly dominate the others in the DML task due to diverse feature representations; and (ii) learning a distance metric on the combined high-dimensional feature space can be extremely time-consuming using the naive feature concatenation approach. To address these limitations, in this paper, we investigate a novel scheme of online multi-modal distance metric learning (OMDML), which explores a unified two-level online learning scheme: (i) it learns to optimize a distance metric on each individual feature space; and (ii) then it learns to find the optimal combination of diverse types of features. To further reduce the expensive cost of DML on high-dimensional feature space, we propose a low-rank OMDML algorithm which not only significantly reduces the computational cost but also retains highly competing or even better learning accuracy. We conduct extensive experiments to evaluate the performance of the proposed algorithms for multi-modal image retrieval, in which encouraging results validate the effectiveness of the proposed technique. Text DML Institutional Knowledge (InK) at Singapore Management University
institution Open Polar
collection Institutional Knowledge (InK) at Singapore Management University
op_collection_id ftsingaporemuniv
language English
topic content-based image retrieval
multi-modal retrieval
distance metric learning
online learning
Databases and Information Systems
spellingShingle content-based image retrieval
multi-modal retrieval
distance metric learning
online learning
Databases and Information Systems
WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
Online multi-modal distance metric learning with application to image retrieval
topic_facet content-based image retrieval
multi-modal retrieval
distance metric learning
online learning
Databases and Information Systems
description Distance metric learning (DML) is an important technique to improve similarity search in content-based image retrieval. Despite being studied extensively, most existing DML approaches typically adopt a single-modal learning framework that learns the distance metric on either a single feature type or a combined feature space where multiple types of features are simply concatenated. Such single-modal DML methods suffer from some critical limitations: (i) some type of features may significantly dominate the others in the DML task due to diverse feature representations; and (ii) learning a distance metric on the combined high-dimensional feature space can be extremely time-consuming using the naive feature concatenation approach. To address these limitations, in this paper, we investigate a novel scheme of online multi-modal distance metric learning (OMDML), which explores a unified two-level online learning scheme: (i) it learns to optimize a distance metric on each individual feature space; and (ii) then it learns to find the optimal combination of diverse types of features. To further reduce the expensive cost of DML on high-dimensional feature space, we propose a low-rank OMDML algorithm which not only significantly reduces the computational cost but also retains highly competing or even better learning accuracy. We conduct extensive experiments to evaluate the performance of the proposed algorithms for multi-modal image retrieval, in which encouraging results validate the effectiveness of the proposed technique.
format Text
author WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
author_facet WU, Pengcheng
HOI, Steven C. H.
ZHAO, Peilin
MIAO, Chunyan
LIU, Zhi-Yong
author_sort WU, Pengcheng
title Online multi-modal distance metric learning with application to image retrieval
title_short Online multi-modal distance metric learning with application to image retrieval
title_full Online multi-modal distance metric learning with application to image retrieval
title_fullStr Online multi-modal distance metric learning with application to image retrieval
title_full_unstemmed Online multi-modal distance metric learning with application to image retrieval
title_sort online multi-modal distance metric learning with application to image retrieval
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/2924
https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=3924&context=sis_research
genre DML
genre_facet DML
op_source Research Collection School Of Computing and Information Systems
op_relation https://ink.library.smu.edu.sg/sis_research/2924
https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=3924&context=sis_research
op_rights http://creativecommons.org/licenses/by-nc-nd/4.0/
op_rightsnorm CC-BY-NC-ND
_version_ 1766397228550717440