Metrics Space and Norm: Taxonomy to Distance Metrics

A lot of machine learning algorithms, including clustering methods such as K-nearest neighbor (KNN), highly depend on the distance metrics to understand the data pattern well and to make the right decision based on the data. In recent years, studies show that distance metrics can significantly impro...

Full description

Bibliographic Details
Published in:Scientific Programming
Main Authors: Barathi Subramanian, Anand Paul, Jeonghong Kim, K.-W.-A. Chee
Format: Article in Journal/Newspaper
Language:English
Published: Hindawi Limited 2022
Subjects:
DML
Online Access:https://doi.org/10.1155/2022/1911345
https://doaj.org/article/3889d72ad42b4815b0c86b83773ee22a
Description
Summary:A lot of machine learning algorithms, including clustering methods such as K-nearest neighbor (KNN), highly depend on the distance metrics to understand the data pattern well and to make the right decision based on the data. In recent years, studies show that distance metrics can significantly improve the performance of the machine learning or deep learning model in clustering, classification, data recovery tasks, etc. In this article, we provide a survey on widely used distance metrics and the challenges associated with this field. The most current studies conducted in this area are commonly influenced by Siamese and triplet networks utilized to make associations between samples while employing mutual weights in deep metric learning (DML). They are successful because of their ability to recognize the relationships among samples that show a similarity. Furthermore, the sampling strategy, suitable distance metric, and network structure are complex and difficult factors for researchers to improve network model performance. So, this article is significant because it is the most recent detailed survey in which these components are comprehensively examined and valued as a whole, evidenced by assessing the numerical findings of the techniques.