An adaptive dropout based deep metric learning algorithm

The key idea of Deep Metric Learning (DML) is to learn a set of hierarchical non-linear mappings using deep neural networks, and then project the data samples into a new feature space for comparing or matching. As its name suggest, DML is a combination of deep learning and metric learning. Deep lear...

Full description

Bibliographic Details
Main Author:	Tan, Ronald Tay Siang
Other Authors:	Zhang Jie, School of Computer Science and Engineering, ZhangJ@ntu.edu.sg
Format:	Other/Unknown Material
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Pattern recognition DML
Online Access:	https://hdl.handle.net/10356/156649

id	ftnanyangtu:oai:dr.ntu.edu.sg:10356/156649
record_format	openpolar
spelling	ftnanyangtu:oai:dr.ntu.edu.sg:10356/156649 2023-05-15T16:01:10+02:00 An adaptive dropout based deep metric learning algorithm Tan, Ronald Tay Siang Zhang Jie School of Computer Science and Engineering ZhangJ@ntu.edu.sg 2022 application/pdf https://hdl.handle.net/10356/156649 en eng Nanyang Technological University SCSE21-0295 Tan, R. T. S. (2022). An adaptive dropout based deep metric learning algorithm. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156649 https://hdl.handle.net/10356/156649 Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Final Year Project (FYP) 2022 ftnanyangtu 2022-04-29T00:13:22Z The key idea of Deep Metric Learning (DML) is to learn a set of hierarchical non-linear mappings using deep neural networks, and then project the data samples into a new feature space for comparing or matching. As its name suggest, DML is a combination of deep learning and metric learning. Deep learning is a machine learning technique that requires the use of large neural networks. Metric learning is also a machine learning technique that utilizes distances between data points to undergo training and testing. Therefore, DML is a combination of these 2 techniques. DML is theorised to be an effective way to utilize a distance metric to learn the similarity between 2 data samples. This allows the model to predict if the data samples inputted is of the same class or not. This brings us to the idea of contrastive learning. Contrastive learning is a machine learning technique used to learn the general features of a dataset without labels by teaching the model which data points are similar or different. This is very useful in this project as such techniques allow us to train the model without any annotations or labels. This allows our model to transit from a supervised learning task to a self-supervised learning task. This is very beneficial given that the labels of the dataset are not given in a one-to-one relationship. The application of DML has achieved many practical successes. This is due to the many modifications made to the several aspects of DML. This includes the modification of model architecture, optimization of objective function, or addition of new training tasks to improve the training process, etc. The list of improvements made to DML goes on but all have their own practical successes due to the different dataset that the algorithm is applied on. Naturally, a different dataset would require a different optimization for the model to fully understand the data points. As many types of modifications and optimizations made to DML, there has not been any existing work that theoretically analyses the generalization error bound for DML. This can be a good method to measure how good a learned DML model is able to perform on unseen data. Therefore, the novel DML method, Adaptive Dropout based DML (ADroDML), is proposed which can adaptively learn the retention rates for the DML models with dropout in a theoretically justified way. This is compared to traditional DML methods of predefined retention rates that are unchanged throughout the training process. ADroDML can theoretically learn the retention rates in an optimal way and update them iteratively to achieve better performance. Experiments on the given dataset were made and found that the algorithm was able to produce a substantial accuracy score, comparable to the results produced in the research paper proposing ADroDML [1]. The algorithm is also able to do predictions to produce similar results on unseen data when tested on other months data. Therefore, this paper proves the success of the proposed ADroDML algorithm when applied on this given dataset. Bachelor of Engineering (Computer Science) Other/Unknown Material DML DR-NTU (Digital Repository at Nanyang Technological University, Singapore)
institution	Open Polar
collection	DR-NTU (Digital Repository at Nanyang Technological University, Singapore)
op_collection_id	ftnanyangtu
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Tan, Ronald Tay Siang An adaptive dropout based deep metric learning algorithm
topic_facet	Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
description	The key idea of Deep Metric Learning (DML) is to learn a set of hierarchical non-linear mappings using deep neural networks, and then project the data samples into a new feature space for comparing or matching. As its name suggest, DML is a combination of deep learning and metric learning. Deep learning is a machine learning technique that requires the use of large neural networks. Metric learning is also a machine learning technique that utilizes distances between data points to undergo training and testing. Therefore, DML is a combination of these 2 techniques. DML is theorised to be an effective way to utilize a distance metric to learn the similarity between 2 data samples. This allows the model to predict if the data samples inputted is of the same class or not. This brings us to the idea of contrastive learning. Contrastive learning is a machine learning technique used to learn the general features of a dataset without labels by teaching the model which data points are similar or different. This is very useful in this project as such techniques allow us to train the model without any annotations or labels. This allows our model to transit from a supervised learning task to a self-supervised learning task. This is very beneficial given that the labels of the dataset are not given in a one-to-one relationship. The application of DML has achieved many practical successes. This is due to the many modifications made to the several aspects of DML. This includes the modification of model architecture, optimization of objective function, or addition of new training tasks to improve the training process, etc. The list of improvements made to DML goes on but all have their own practical successes due to the different dataset that the algorithm is applied on. Naturally, a different dataset would require a different optimization for the model to fully understand the data points. As many types of modifications and optimizations made to DML, there has not been any existing work that theoretically analyses the generalization error bound for DML. This can be a good method to measure how good a learned DML model is able to perform on unseen data. Therefore, the novel DML method, Adaptive Dropout based DML (ADroDML), is proposed which can adaptively learn the retention rates for the DML models with dropout in a theoretically justified way. This is compared to traditional DML methods of predefined retention rates that are unchanged throughout the training process. ADroDML can theoretically learn the retention rates in an optimal way and update them iteratively to achieve better performance. Experiments on the given dataset were made and found that the algorithm was able to produce a substantial accuracy score, comparable to the results produced in the research paper proposing ADroDML [1]. The algorithm is also able to do predictions to produce similar results on unseen data when tested on other months data. Therefore, this paper proves the success of the proposed ADroDML algorithm when applied on this given dataset. Bachelor of Engineering (Computer Science)
author2	Zhang Jie School of Computer Science and Engineering ZhangJ@ntu.edu.sg
format	Other/Unknown Material
author	Tan, Ronald Tay Siang
author_facet	Tan, Ronald Tay Siang
author_sort	Tan, Ronald Tay Siang
title	An adaptive dropout based deep metric learning algorithm
title_short	An adaptive dropout based deep metric learning algorithm
title_full	An adaptive dropout based deep metric learning algorithm
title_fullStr	An adaptive dropout based deep metric learning algorithm
title_full_unstemmed	An adaptive dropout based deep metric learning algorithm
title_sort	adaptive dropout based deep metric learning algorithm
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/156649
genre	DML
genre_facet	DML
op_relation	SCSE21-0295 Tan, R. T. S. (2022). An adaptive dropout based deep metric learning algorithm. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156649 https://hdl.handle.net/10356/156649
_version_	1766397139735281664

An adaptive dropout based deep metric learning algorithm

Similar Items