TVD: a reproducible and multiply aligned TV series dataset

International audience We introduce a new dataset built around two TV series from different genres, The Big Bang Theory, a situation comedy and Game of Thrones, a fantasy drama. The dataset has multiple tracks extracted from diverse sources, including dialogue (manual and automatic transcripts, mult...

Full description

Bibliographic Details
Main Authors:	Roy, Anindya, Guinaudeau, Camille, Bredin, Hervé, Barras, Claude
Other Authors:	Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI), Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919), Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE), ANR-10-CORD-0103,QCOMPERE,Consortium Quaero pour la Reconnaissane Multimodale des Personnes(2010), ANR-12-CHRI-0006,CAMOMILE,Annotation collaborative de documents multi-modaux, multi-lingues et multi-média(2012)
Format:	Conference Object
Language:	English
Published:	HAL CCSD 2014
Subjects:	TV Series transcripts episode outlines [INFO]Computer Science [cs] [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM] Iceland
Online Access:	https://hal.science/hal-01690279 https://hal.science/hal-01690279/document https://hal.science/hal-01690279/file/751_Paper.pdf

id	ftsorbonneuniv:oai:HAL:hal-01690279v1
record_format	openpolar
spelling	ftsorbonneuniv:oai:HAL:hal-01690279v1 2023-11-05T03:42:56+01:00 TVD: a reproducible and multiply aligned TV series dataset Roy, Anindya Guinaudeau, Camille Bredin, Hervé Barras, Claude Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE) ANR-10-CORD-0103,QCOMPERE,Consortium Quaero pour la Reconnaissane Multimodale des Personnes(2010) ANR-12-CHRI-0006,CAMOMILE,Annotation collaborative de documents multi-modaux, multi-lingues et multi-média(2012) Reykjavik, Iceland 2014-05 https://hal.science/hal-01690279 https://hal.science/hal-01690279/document https://hal.science/hal-01690279/file/751_Paper.pdf en eng HAL CCSD hal-01690279 https://hal.science/hal-01690279 https://hal.science/hal-01690279/document https://hal.science/hal-01690279/file/751_Paper.pdf info:eu-repo/semantics/OpenAccess LREC 2014 https://hal.science/hal-01690279 LREC 2014, May 2014, Reykjavik, Iceland TV Series transcripts episode outlines [INFO]Computer Science [cs] [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftsorbonneuniv 2023-10-11T00:01:10Z International audience We introduce a new dataset built around two TV series from different genres, The Big Bang Theory, a situation comedy and Game of Thrones, a fantasy drama. The dataset has multiple tracks extracted from diverse sources, including dialogue (manual and automatic transcripts, multilingual subtitles), crowd-sourced textual descriptions (brief episode summaries, longer episode outlines) and various metadata (speakers, shots, scenes). The paper describes the dataset and provide tools to reproduce it for research purposes provided one has legally acquired the DVD set of the series. Tools are also provided to temporally align a major subset of dialogue and description tracks, in order to combine complementary information present in these tracks for enhanced accessibility. For alignment, we consider tracks as comparable corpora and first apply an existing algorithm for aligning such corpora based on dynamic time warping and TFIDF-based similarity scores. We improve this baseline algorithm using contextual information, WordNet-based word similarity and scene location information. We report the performance of these algorithms on a manually aligned subset of the data. To highlight the interest of the database, we report a use case involving rich speech retrieval and propose other uses. Conference Object Iceland HAL Sorbonne Université
institution	Open Polar
collection	HAL Sorbonne Université
op_collection_id	ftsorbonneuniv
language	English
topic	TV Series transcripts episode outlines [INFO]Computer Science [cs] [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]
spellingShingle	TV Series transcripts episode outlines [INFO]Computer Science [cs] [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM] Roy, Anindya Guinaudeau, Camille Bredin, Hervé Barras, Claude TVD: a reproducible and multiply aligned TV series dataset
topic_facet	TV Series transcripts episode outlines [INFO]Computer Science [cs] [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]
description	International audience We introduce a new dataset built around two TV series from different genres, The Big Bang Theory, a situation comedy and Game of Thrones, a fantasy drama. The dataset has multiple tracks extracted from diverse sources, including dialogue (manual and automatic transcripts, multilingual subtitles), crowd-sourced textual descriptions (brief episode summaries, longer episode outlines) and various metadata (speakers, shots, scenes). The paper describes the dataset and provide tools to reproduce it for research purposes provided one has legally acquired the DVD set of the series. Tools are also provided to temporally align a major subset of dialogue and description tracks, in order to combine complementary information present in these tracks for enhanced accessibility. For alignment, we consider tracks as comparable corpora and first apply an existing algorithm for aligning such corpora based on dynamic time warping and TFIDF-based similarity scores. We improve this baseline algorithm using contextual information, WordNet-based word similarity and scene location information. We report the performance of these algorithms on a manually aligned subset of the data. To highlight the interest of the database, we report a use case involving rich speech retrieval and propose other uses.
author2	Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE) ANR-10-CORD-0103,QCOMPERE,Consortium Quaero pour la Reconnaissane Multimodale des Personnes(2010) ANR-12-CHRI-0006,CAMOMILE,Annotation collaborative de documents multi-modaux, multi-lingues et multi-média(2012)
format	Conference Object
author	Roy, Anindya Guinaudeau, Camille Bredin, Hervé Barras, Claude
author_facet	Roy, Anindya Guinaudeau, Camille Bredin, Hervé Barras, Claude
author_sort	Roy, Anindya
title	TVD: a reproducible and multiply aligned TV series dataset
title_short	TVD: a reproducible and multiply aligned TV series dataset
title_full	TVD: a reproducible and multiply aligned TV series dataset
title_fullStr	TVD: a reproducible and multiply aligned TV series dataset
title_full_unstemmed	TVD: a reproducible and multiply aligned TV series dataset
title_sort	tvd: a reproducible and multiply aligned tv series dataset
publisher	HAL CCSD
publishDate	2014
url	https://hal.science/hal-01690279 https://hal.science/hal-01690279/document https://hal.science/hal-01690279/file/751_Paper.pdf
op_coverage	Reykjavik, Iceland
genre	Iceland
genre_facet	Iceland
op_source	LREC 2014 https://hal.science/hal-01690279 LREC 2014, May 2014, Reykjavik, Iceland
op_relation	hal-01690279 https://hal.science/hal-01690279 https://hal.science/hal-01690279/document https://hal.science/hal-01690279/file/751_Paper.pdf
op_rights	info:eu-repo/semantics/OpenAccess
_version_	1781700573352951808

TVD: a reproducible and multiply aligned TV series dataset

Similar Items