Near-Duplicate Video Retrieval with Deep Metric Learning

This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tande...

Full description

Bibliographic Details
Published in:	2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
Main Authors:	Kordopatis-Zilos, Giorgos, Papadopoulos, Symeon, Patras, Ioannis, Kompatsiaris, Yiannis
Format:	Conference Object
Language:	unknown
Published:	2017
Subjects:	Near-Duplicate Video Retrieval Deep Metric Learning Triplets DML
Online Access:	https://zenodo.org/record/1051270 https://doi.org/10.1109/ICCVW.2017.49

id	ftzenodo:oai:zenodo.org:1051270
record_format	openpolar
spelling	ftzenodo:oai:zenodo.org:1051270 2023-05-15T16:01:55+02:00 Near-Duplicate Video Retrieval with Deep Metric Learning Kordopatis-Zilos, Giorgos Papadopoulos, Symeon Patras, Ioannis Kompatsiaris, Yiannis 2017-10-23 https://zenodo.org/record/1051270 https://doi.org/10.1109/ICCVW.2017.49 unknown info:eu-repo/grantAgreement/EC/H2020/687786/ https://zenodo.org/communities/invid-h2020 https://zenodo.org/record/1051270 https://doi.org/10.1109/ICCVW.2017.49 oai:zenodo.org:1051270 info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by/4.0/legalcode Near-Duplicate Video Retrieval Deep Metric Learning info:eu-repo/semantics/conferencePaper publication-conferencepaper 2017 ftzenodo https://doi.org/10.1109/ICCVW.2017.49 2023-03-11T01:58:05Z This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC_WEB_VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset. Conference Object DML Zenodo Triplets ENVELOPE(-59.750,-59.750,-62.383,-62.383) 2017 IEEE International Conference on Computer Vision Workshops (ICCVW) 347 356
institution	Open Polar
collection	Zenodo
op_collection_id	ftzenodo
language	unknown
topic	Near-Duplicate Video Retrieval Deep Metric Learning
spellingShingle	Near-Duplicate Video Retrieval Deep Metric Learning Kordopatis-Zilos, Giorgos Papadopoulos, Symeon Patras, Ioannis Kompatsiaris, Yiannis Near-Duplicate Video Retrieval with Deep Metric Learning
topic_facet	Near-Duplicate Video Retrieval Deep Metric Learning
description	This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC_WEB_VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset.
format	Conference Object
author	Kordopatis-Zilos, Giorgos Papadopoulos, Symeon Patras, Ioannis Kompatsiaris, Yiannis
author_facet	Kordopatis-Zilos, Giorgos Papadopoulos, Symeon Patras, Ioannis Kompatsiaris, Yiannis
author_sort	Kordopatis-Zilos, Giorgos
title	Near-Duplicate Video Retrieval with Deep Metric Learning
title_short	Near-Duplicate Video Retrieval with Deep Metric Learning
title_full	Near-Duplicate Video Retrieval with Deep Metric Learning
title_fullStr	Near-Duplicate Video Retrieval with Deep Metric Learning
title_full_unstemmed	Near-Duplicate Video Retrieval with Deep Metric Learning
title_sort	near-duplicate video retrieval with deep metric learning
publishDate	2017
url	https://zenodo.org/record/1051270 https://doi.org/10.1109/ICCVW.2017.49
long_lat	ENVELOPE(-59.750,-59.750,-62.383,-62.383)
geographic	Triplets
geographic_facet	Triplets
genre	DML
genre_facet	DML
op_relation	info:eu-repo/grantAgreement/EC/H2020/687786/ https://zenodo.org/communities/invid-h2020 https://zenodo.org/record/1051270 https://doi.org/10.1109/ICCVW.2017.49 oai:zenodo.org:1051270
op_rights	info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by/4.0/legalcode
op_doi	https://doi.org/10.1109/ICCVW.2017.49
container_title	2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
container_start_page	347
op_container_end_page	356
_version_	1766397595266056192

Near-Duplicate Video Retrieval with Deep Metric Learning

Similar Items