Near-Duplicate Video Retrieval With Deep Metric Learning

This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tande...

Full description

Bibliographic Details
Main Authors: Kordopatis-Zilos, Giorgos, Papadopoulos, Symeon, Patras, Ioannis, Kompatsiaris, Yiannis
Format: Conference Object
Language:unknown
Published: Zenodo 2017
Subjects:
DML
Online Access:https://dx.doi.org/10.5281/zenodo.852351
https://zenodo.org/record/852351
id ftdatacite:10.5281/zenodo.852351
record_format openpolar
spelling ftdatacite:10.5281/zenodo.852351 2023-05-15T16:01:55+02:00 Near-Duplicate Video Retrieval With Deep Metric Learning Kordopatis-Zilos, Giorgos Papadopoulos, Symeon Patras, Ioannis Kompatsiaris, Yiannis 2017 https://dx.doi.org/10.5281/zenodo.852351 https://zenodo.org/record/852351 unknown Zenodo https://dx.doi.org/10.5281/zenodo.852350 Open Access Creative Commons Attribution 4.0 https://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess CC-BY Near-Duplicate Video Retrieval Deep Metric Learning Text Conference paper article-journal ScholarlyArticle 2017 ftdatacite https://doi.org/10.5281/zenodo.852351 https://doi.org/10.5281/zenodo.852350 2021-11-05T12:55:41Z This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC WEB VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset. Conference Object DML DataCite Metadata Store (German National Library of Science and Technology) Triplets ENVELOPE(-59.750,-59.750,-62.383,-62.383)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Near-Duplicate
Video Retrieval
Deep Metric Learning
spellingShingle Near-Duplicate
Video Retrieval
Deep Metric Learning
Kordopatis-Zilos, Giorgos
Papadopoulos, Symeon
Patras, Ioannis
Kompatsiaris, Yiannis
Near-Duplicate Video Retrieval With Deep Metric Learning
topic_facet Near-Duplicate
Video Retrieval
Deep Metric Learning
description This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC WEB VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset.
format Conference Object
author Kordopatis-Zilos, Giorgos
Papadopoulos, Symeon
Patras, Ioannis
Kompatsiaris, Yiannis
author_facet Kordopatis-Zilos, Giorgos
Papadopoulos, Symeon
Patras, Ioannis
Kompatsiaris, Yiannis
author_sort Kordopatis-Zilos, Giorgos
title Near-Duplicate Video Retrieval With Deep Metric Learning
title_short Near-Duplicate Video Retrieval With Deep Metric Learning
title_full Near-Duplicate Video Retrieval With Deep Metric Learning
title_fullStr Near-Duplicate Video Retrieval With Deep Metric Learning
title_full_unstemmed Near-Duplicate Video Retrieval With Deep Metric Learning
title_sort near-duplicate video retrieval with deep metric learning
publisher Zenodo
publishDate 2017
url https://dx.doi.org/10.5281/zenodo.852351
https://zenodo.org/record/852351
long_lat ENVELOPE(-59.750,-59.750,-62.383,-62.383)
geographic Triplets
geographic_facet Triplets
genre DML
genre_facet DML
op_relation https://dx.doi.org/10.5281/zenodo.852350
op_rights Open Access
Creative Commons Attribution 4.0
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
op_rightsnorm CC-BY
op_doi https://doi.org/10.5281/zenodo.852351
https://doi.org/10.5281/zenodo.852350
_version_ 1766397592133959680