Finding Near-Duplicate Videos in Large-Scale Collections

This chapter discusses the problem of Near-Duplicate Video Retrieval (NDVR). The main objective of a typical NDVR approach is: given a query video, retrieve all near-duplicate videos in a video repository and rank them based on their similarity to the query. Several approaches have been introduced i...

Full description

Bibliographic Details
Main Authors: Kordopatis-Zilos, G, Papadopoulos, S, Patras, I, Kompatsiaris, I
Other Authors: Mezaris, V, Nixon, L, Teyssou, D
Format: Book Part
Language:unknown
Published: Springer 2019
Subjects:
DML
Online Access:https://qmro.qmul.ac.uk/xmlui/handle/123456789/61983
https://doi.org/10.1007/978-3-030-26752-0_4
Description
Summary:This chapter discusses the problem of Near-Duplicate Video Retrieval (NDVR). The main objective of a typical NDVR approach is: given a query video, retrieve all near-duplicate videos in a video repository and rank them based on their similarity to the query. Several approaches have been introduced in the literature, which can be roughly classified in three categories based on the level of video matching, i.e., (i) video-level, (ii) frame-level, and (iii) filter-and-refine matching. Two methods based on video-level matching are presented in this chapter. The first is an unsupervised scheme that relies on a modified Bag-of-Words (BoW) video representation. The second is a s upervised method based on Deep Metric Learning (DML). For the development of both methods, features are extracted from the intermediate layers of Convolutional Neural Networks and leveraged as frame descriptors, since they offer a compact and informative image representation, and lead to increased system efficiency. Extensive evaluation has been conducted on publicly available benchmark datasets, and the presented methods are compared with state-of-the-art approaches, achieving the best results in all evaluation setups.