Feature Representation in Deep Metric Embeddings

In deep metric learning (DML), high-level input data are represented in a lower-level representation (embedding) space, such that samples from the same class are mapped close together, while samples from disparate classes are mapped further apart. In this lower-level representation, only a single in...

Full description

Bibliographic Details
Main Authors: Furlong, Ryan, O'Brien, Vincent, Garland, James, Palacios-Alonso, Daniel, Dominguez-Mateos, Francisco
Format: Text
Language:unknown
Published: 2021
Subjects:
DML
Online Access:http://arxiv.org/abs/2102.03176
id ftarxivpreprints:oai:arXiv.org:2102.03176
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2102.03176 2023-09-05T13:19:06+02:00 Feature Representation in Deep Metric Embeddings Furlong, Ryan O'Brien, Vincent Garland, James Palacios-Alonso, Daniel Dominguez-Mateos, Francisco 2021-02-05 http://arxiv.org/abs/2102.03176 unknown http://arxiv.org/abs/2102.03176 Computer Science - Computer Vision and Pattern Recognition text 2021 ftarxivpreprints 2023-08-16T16:19:46Z In deep metric learning (DML), high-level input data are represented in a lower-level representation (embedding) space, such that samples from the same class are mapped close together, while samples from disparate classes are mapped further apart. In this lower-level representation, only a single inference sample from each known class is required to discriminate between classes accurately. The features a DML model uses to discriminate between classes and the importance of each feature in the training process are unknown. To investigate this, this study takes embeddings trained to discriminate faces (identities) and uses unsupervised clustering to identify the features involved in facial identity discrimination by examining their representation within the embedded space. This study is split into two cases; intra class sub-discrimination, where attributes that differ between a single identity are considered; such as beards and emotions; and extra class sub-discrimination, where attributes which differ between different identities/people, are considered; such as gender, skin tone and age. In the intra class scenario, the inference process distinguishes common attributes between single identities, achieving 90.0\% and 76.0\% accuracy for beards and glasses, respectively. The system can also perform extra class sub-discrimination with a high accuracy rate, notably 99.3\%, 99.3\% and 94.1\% for gender, skin tone, and age, respectively. Text DML ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Furlong, Ryan
O'Brien, Vincent
Garland, James
Palacios-Alonso, Daniel
Dominguez-Mateos, Francisco
Feature Representation in Deep Metric Embeddings
topic_facet Computer Science - Computer Vision and Pattern Recognition
description In deep metric learning (DML), high-level input data are represented in a lower-level representation (embedding) space, such that samples from the same class are mapped close together, while samples from disparate classes are mapped further apart. In this lower-level representation, only a single inference sample from each known class is required to discriminate between classes accurately. The features a DML model uses to discriminate between classes and the importance of each feature in the training process are unknown. To investigate this, this study takes embeddings trained to discriminate faces (identities) and uses unsupervised clustering to identify the features involved in facial identity discrimination by examining their representation within the embedded space. This study is split into two cases; intra class sub-discrimination, where attributes that differ between a single identity are considered; such as beards and emotions; and extra class sub-discrimination, where attributes which differ between different identities/people, are considered; such as gender, skin tone and age. In the intra class scenario, the inference process distinguishes common attributes between single identities, achieving 90.0\% and 76.0\% accuracy for beards and glasses, respectively. The system can also perform extra class sub-discrimination with a high accuracy rate, notably 99.3\%, 99.3\% and 94.1\% for gender, skin tone, and age, respectively.
format Text
author Furlong, Ryan
O'Brien, Vincent
Garland, James
Palacios-Alonso, Daniel
Dominguez-Mateos, Francisco
author_facet Furlong, Ryan
O'Brien, Vincent
Garland, James
Palacios-Alonso, Daniel
Dominguez-Mateos, Francisco
author_sort Furlong, Ryan
title Feature Representation in Deep Metric Embeddings
title_short Feature Representation in Deep Metric Embeddings
title_full Feature Representation in Deep Metric Embeddings
title_fullStr Feature Representation in Deep Metric Embeddings
title_full_unstemmed Feature Representation in Deep Metric Embeddings
title_sort feature representation in deep metric embeddings
publishDate 2021
url http://arxiv.org/abs/2102.03176
genre DML
genre_facet DML
op_relation http://arxiv.org/abs/2102.03176
_version_ 1776199909709447168