Content-based Information Retrieval via Nearest Neighbor Search

Content-based information retrieval (CBIR) has attracted significant interest in the past few years. When given a search query, the search engine will compare the query with all the stored information in the database through nearest neighbor search. Finally, the system will return the most similar i...

Full description

Bibliographic Details
Other Authors: Huang, Yinjie (Author), Georgiopoulos, Michael (Committee Chair), Anagnostopoulos, Georgios (Committee CoChair), Hu, Haiyan (Committee Member), Sukthankar, Gita (Committee Member), Ni, Liqiang (Committee Member), University of Central Florida (Degree Grantor)
Format: Text
Language:English
Published: University of Central Florida
Subjects:
DML
Online Access:http://purl.flvc.org/ucf/fd/CFE0006327
id ftucentralflordl:oai:ucf.digital.flvc.org:ucf_51544
record_format openpolar
spelling ftucentralflordl:oai:ucf.digital.flvc.org:ucf_51544 2023-11-12T04:16:28+01:00 Content-based Information Retrieval via Nearest Neighbor Search Huang, Yinjie (Author) Georgiopoulos, Michael (Committee Chair) Anagnostopoulos, Georgios (Committee CoChair) Hu, Haiyan (Committee Member) Sukthankar, Gita (Committee Member) Ni, Liqiang (Committee Member) University of Central Florida (Degree Grantor) http://purl.flvc.org/ucf/fd/CFE0006327 English eng University of Central Florida CFE0006327 ucf:51544 http://purl.flvc.org/ucf/fd/CFE0006327 public 2016-08-15 Content-based Information Retrieval--Metric Learning--Hash Function Learning Text ftucentralflordl 2023-10-24T16:35:50Z Content-based information retrieval (CBIR) has attracted significant interest in the past few years. When given a search query, the search engine will compare the query with all the stored information in the database through nearest neighbor search. Finally, the system will return the most similar items. We contribute to the CBIR research the following: firstly, Distance Metric Learning (DML) is studied to improve retrieval accuracy of nearest neighbor search. Additionally, Hash Function Learning (HFL) is considered to accelerate the retrieval process.On one hand, a new local metric learning framework is proposed - Reduced-Rank Local Metric Learning (R2LML). By considering a conical combination of Mahalanobis metrics, the proposed method is able to better capture information like data's similarity and location. A regularization to suppress the noise and avoid over-fitting is also incorporated into the formulation. Based on the different methods to infer the weights for the local metric, we considered two frameworks: Transductive Reduced-Rank Local Metric Learning (T-R2LML), which utilizes transductive learning, while Efficient Reduced-Rank Local Metric Learning (E-R2LML)employs a simpler and faster approximated method. Besides, we study the convergence property of the proposed block coordinate descent algorithms for both our frameworks. The extensive experiments show the superiority of our approaches.On the other hand, *Supervised Hash Learning (*SHL), which could be used in supervised, semi-supervised and unsupervised learning scenarios, was proposed in the dissertation. By considering several codewords which could be learned from the data, the proposed method naturally derives to several Support Vector Machine (SVM) problems. After providing an efficient training algorithm, we also study the theoretical generalization bound of the new hashing framework. In the final experiments, *SHL outperforms many other popular hash function learning methods. Additionally, in order to cope with large data sets, we also ... Text DML UCF Digital Collections (University of Central Florida)
institution Open Polar
collection UCF Digital Collections (University of Central Florida)
op_collection_id ftucentralflordl
language English
topic Content-based Information Retrieval--Metric Learning--Hash Function Learning
spellingShingle Content-based Information Retrieval--Metric Learning--Hash Function Learning
Content-based Information Retrieval via Nearest Neighbor Search
topic_facet Content-based Information Retrieval--Metric Learning--Hash Function Learning
description Content-based information retrieval (CBIR) has attracted significant interest in the past few years. When given a search query, the search engine will compare the query with all the stored information in the database through nearest neighbor search. Finally, the system will return the most similar items. We contribute to the CBIR research the following: firstly, Distance Metric Learning (DML) is studied to improve retrieval accuracy of nearest neighbor search. Additionally, Hash Function Learning (HFL) is considered to accelerate the retrieval process.On one hand, a new local metric learning framework is proposed - Reduced-Rank Local Metric Learning (R2LML). By considering a conical combination of Mahalanobis metrics, the proposed method is able to better capture information like data's similarity and location. A regularization to suppress the noise and avoid over-fitting is also incorporated into the formulation. Based on the different methods to infer the weights for the local metric, we considered two frameworks: Transductive Reduced-Rank Local Metric Learning (T-R2LML), which utilizes transductive learning, while Efficient Reduced-Rank Local Metric Learning (E-R2LML)employs a simpler and faster approximated method. Besides, we study the convergence property of the proposed block coordinate descent algorithms for both our frameworks. The extensive experiments show the superiority of our approaches.On the other hand, *Supervised Hash Learning (*SHL), which could be used in supervised, semi-supervised and unsupervised learning scenarios, was proposed in the dissertation. By considering several codewords which could be learned from the data, the proposed method naturally derives to several Support Vector Machine (SVM) problems. After providing an efficient training algorithm, we also study the theoretical generalization bound of the new hashing framework. In the final experiments, *SHL outperforms many other popular hash function learning methods. Additionally, in order to cope with large data sets, we also ...
author2 Huang, Yinjie (Author)
Georgiopoulos, Michael (Committee Chair)
Anagnostopoulos, Georgios (Committee CoChair)
Hu, Haiyan (Committee Member)
Sukthankar, Gita (Committee Member)
Ni, Liqiang (Committee Member)
University of Central Florida (Degree Grantor)
format Text
title Content-based Information Retrieval via Nearest Neighbor Search
title_short Content-based Information Retrieval via Nearest Neighbor Search
title_full Content-based Information Retrieval via Nearest Neighbor Search
title_fullStr Content-based Information Retrieval via Nearest Neighbor Search
title_full_unstemmed Content-based Information Retrieval via Nearest Neighbor Search
title_sort content-based information retrieval via nearest neighbor search
publisher University of Central Florida
url http://purl.flvc.org/ucf/fd/CFE0006327
genre DML
genre_facet DML
op_relation CFE0006327
ucf:51544
http://purl.flvc.org/ucf/fd/CFE0006327
op_rights public 2016-08-15
_version_ 1782333545268641792