Representation learning for domain adaptation and cross-modal retrieval

Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for...

Full description

Bibliographic Details
Main Author: Ott, Felix
Format: Thesis
Language:unknown
Published: Ludwig-Maximilians-Universität München 2023
Subjects:
DML
Online Access:https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf
http://nbn-resolving.de/urn:nbn:de:bvb:19-324674
id ftmuenchenedoc:oai:edoc.ub.uni-muenchen.de:32467
record_format openpolar
spelling ftmuenchenedoc:oai:edoc.ub.uni-muenchen.de:32467 2023-11-05T03:41:36+01:00 Representation learning for domain adaptation and cross-modal retrieval Ott, Felix 2023-07-18 application/pdf https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674 unknown Ludwig-Maximilians-Universität München https://edoc.ub.uni-muenchen.de/32467/ https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf http://nbn-resolving.de/urn:nbn:de:bvb:19-324674 Ott, Felix (2023): Representation learning for domain adaptation and cross-modal retrieval: in the context of online handwriting recognition and visual self-localization. Dissertation, LMU München: Fakultät für Mathematik, Informatik und Statistik Fakultät für Mathematik Informatik und Statistik ddc:000 ddc:004 Dissertationen NonPeerReviewed 2023 ftmuenchenedoc 2023-10-08T23:38:45Z Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for speech or handwriting recognition), computer vision (e.g., a change of weather conditions or objects in the environment for visual self-localization), and neural language processing (e.g., switching between different languages). Another related field is cross-modal retrieval, which aims to efficiently extract information from various modalities. In this field, the data can exhibit variations between each modality. Such variations in data between the modalities can negatively impact the performance of the model. To reduce the impact of domain shift, methods search for an optimal transformation from the source to the target domain or an optimal alignment of modalities to learn a domain-invariant representation that is not affected by domain differences. The alignment of features of various data sources that are affected by domain shift requires representation learning techniques. These techniques are used to learn a meaningful representation that can be interpreted, or that includes latent features through the use of deep metric learning (DML). DML minimizes the distance between features by using the standard Euclidean loss, maximizes the similarity of features through cross correlation, or decreases the discrepancy of higher-order statistics like the maximum mean discrepancy. A similar but distinct field is pairwise learning and contrastive learning, which also employs DML. Contrastive learning not only aligns the features of data input pairs that have the same class label, but also increases the distance between pairs that have similar but different labels, thus enhancing the training process. This research presents techniques for domain adaptation and cross-modal retrieval that specifically focus on the following two ... Thesis DML Electronic Theses of LMU Munich (Ludwig-Maximilians-Universität)
institution Open Polar
collection Electronic Theses of LMU Munich (Ludwig-Maximilians-Universität)
op_collection_id ftmuenchenedoc
language unknown
topic Fakultät für Mathematik
Informatik und Statistik
ddc:000
ddc:004
spellingShingle Fakultät für Mathematik
Informatik und Statistik
ddc:000
ddc:004
Ott, Felix
Representation learning for domain adaptation and cross-modal retrieval
topic_facet Fakultät für Mathematik
Informatik und Statistik
ddc:000
ddc:004
description Most machine learning applications involve a domain shift between data on which a model has initially been trained and data from a similar but different domain to which the model is later applied on. Applications range from human computer interaction (e.g., humans with different characteristics for speech or handwriting recognition), computer vision (e.g., a change of weather conditions or objects in the environment for visual self-localization), and neural language processing (e.g., switching between different languages). Another related field is cross-modal retrieval, which aims to efficiently extract information from various modalities. In this field, the data can exhibit variations between each modality. Such variations in data between the modalities can negatively impact the performance of the model. To reduce the impact of domain shift, methods search for an optimal transformation from the source to the target domain or an optimal alignment of modalities to learn a domain-invariant representation that is not affected by domain differences. The alignment of features of various data sources that are affected by domain shift requires representation learning techniques. These techniques are used to learn a meaningful representation that can be interpreted, or that includes latent features through the use of deep metric learning (DML). DML minimizes the distance between features by using the standard Euclidean loss, maximizes the similarity of features through cross correlation, or decreases the discrepancy of higher-order statistics like the maximum mean discrepancy. A similar but distinct field is pairwise learning and contrastive learning, which also employs DML. Contrastive learning not only aligns the features of data input pairs that have the same class label, but also increases the distance between pairs that have similar but different labels, thus enhancing the training process. This research presents techniques for domain adaptation and cross-modal retrieval that specifically focus on the following two ...
format Thesis
author Ott, Felix
author_facet Ott, Felix
author_sort Ott, Felix
title Representation learning for domain adaptation and cross-modal retrieval
title_short Representation learning for domain adaptation and cross-modal retrieval
title_full Representation learning for domain adaptation and cross-modal retrieval
title_fullStr Representation learning for domain adaptation and cross-modal retrieval
title_full_unstemmed Representation learning for domain adaptation and cross-modal retrieval
title_sort representation learning for domain adaptation and cross-modal retrieval
publisher Ludwig-Maximilians-Universität München
publishDate 2023
url https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf
http://nbn-resolving.de/urn:nbn:de:bvb:19-324674
genre DML
genre_facet DML
op_relation https://edoc.ub.uni-muenchen.de/32467/
https://edoc.ub.uni-muenchen.de/32467/1/Ott_Felix.pdf
http://nbn-resolving.de/urn:nbn:de:bvb:19-324674
Ott, Felix (2023): Representation learning for domain adaptation and cross-modal retrieval: in the context of online handwriting recognition and visual self-localization. Dissertation, LMU München: Fakultät für Mathematik, Informatik und Statistik
_version_ 1781698046816419840