Representation learning for domain adaptation and cross-modal retrieval

Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for...

Full description

Bibliographic Details
Main Author:	Ott, Felix
Format:	Thesis
Language:	unknown
Published:	Ludwig-Maximilians-Universität München 2023
Subjects:	Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004 DML
Online Access:	https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674

id	ftmuenchenedoc:oai:edoc.ub.uni-muenchen.de:32467
record_format	openpolar
spelling	ftmuenchenedoc:oai:edoc.ub.uni-muenchen.de:32467 2023-11-05T03:41:36+01:00 Representation learning for domain adaptation and cross-modal retrieval Ott, Felix 2023-07-18 application/pdf https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674 unknown Ludwig-Maximilians-Universität München https://edoc.ub.uni-muenchen.de/32467/ https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674 Ott, Felix (2023): Representation learning for domain adaptation and cross-modal retrieval: in the context of online handwriting recognition and visual self-localization. Dissertation, LMU München: Fakultät für Mathematik, Informatik und Statistik Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004 Dissertationen NonPeerReviewed 2023 ftmuenchenedoc 2023-10-08T23:38:45Z Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for speech or handwriting recognition), computer vision (e.g., a change of weather conditions or objects in the environment for visual self-localization), and neural language processing (e.g., switching between different languages). Another related field is cross-modal retrieval, which aims to efficiently extract information from various modalities. In this field, the data can exhibit variations between each modality. Such variations in data between the modalities can negatively impact the performance of the model. To reduce the impact of domain shift, methods search for an optimal transformation from the source to the target domain or an optimal alignment of modalities to learn a domain-invariant representation that is not affected by domain differences. The alignment of features of various data sources that are affected by domain shift requires representation learning techniques. These techniques are used to learn a meaningful representation that can be interpreted, or that includes latent features through the use of deep metric learning (DML). DML minimizes the distance between features by using the standard Euclidean loss, maximizes the similarity of features through cross correlation, or decreases the discrepancy of higher-order statistics like the maximum mean discrepancy. A similar but distinct field is pairwise learning and contrastive learning, which also employs DML. Contrastive learning not only aligns the features of data input pairs that have the same class label, but also increases the distance between pairs that have similar but different labels, thus enhancing the training process. This research presents techniques for domain adaptation and cross-modal retrieval that specifically focus on the following two ... Thesis DML Electronic Theses of LMU Munich (Ludwig-Maximilians-Universität)
institution	Open Polar
collection	Electronic Theses of LMU Munich (Ludwig-Maximilians-Universität)
op_collection_id	ftmuenchenedoc
language	unknown
topic	Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004
spellingShingle	Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004 Ott, Felix Representation learning for domain adaptation and cross-modal retrieval
topic_facet	Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004
description	Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for speech or handwriting recognition), computer vision (e.g., a change of weather conditions or objects in the environment for visual self-localization), and neural language processing (e.g., switching between different languages). Another related field is cross-modal retrieval, which aims to efficiently extract information from various modalities. In this field, the data can exhibit variations between each modality. Such variations in data between the modalities can negatively impact the performance of the model. To reduce the impact of domain shift, methods search for an optimal transformation from the source to the target domain or an optimal alignment of modalities to learn a domain-invariant representation that is not affected by domain differences. The alignment of features of various data sources that are affected by domain shift requires representation learning techniques. These techniques are used to learn a meaningful representation that can be interpreted, or that includes latent features through the use of deep metric learning (DML). DML minimizes the distance between features by using the standard Euclidean loss, maximizes the similarity of features through cross correlation, or decreases the discrepancy of higher-order statistics like the maximum mean discrepancy. A similar but distinct field is pairwise learning and contrastive learning, which also employs DML. Contrastive learning not only aligns the features of data input pairs that have the same class label, but also increases the distance between pairs that have similar but different labels, thus enhancing the training process. This research presents techniques for domain adaptation and cross-modal retrieval that specifically focus on the following two ...
format	Thesis
author	Ott, Felix
author_facet	Ott, Felix
author_sort	Ott, Felix
title	Representation learning for domain adaptation and cross-modal retrieval
title_short	Representation learning for domain adaptation and cross-modal retrieval
title_full	Representation learning for domain adaptation and cross-modal retrieval
title_fullStr	Representation learning for domain adaptation and cross-modal retrieval
title_full_unstemmed	Representation learning for domain adaptation and cross-modal retrieval
title_sort	representation learning for domain adaptation and cross-modal retrieval
publisher	Ludwig-Maximilians-Universität München
publishDate	2023
url	https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674
genre	DML
genre_facet	DML
op_relation	https://edoc.ub.uni-muenchen.de/32467/ https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674 Ott, Felix (2023): Representation learning for domain adaptation and cross-modal retrieval: in the context of online handwriting recognition and visual self-localization. Dissertation, LMU München: Fakultät für Mathematik, Informatik und Statistik
_version_	1781698046816419840

Representation learning for domain adaptation and cross-modal retrieval

Similar Items