Construction of a French-LSF corpus

International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect o...

Full description

Bibliographic Details
Main Authors: Filhol, Michael, Tannier, Xavier
Other Authors: Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI), Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919), Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.science/hal-01848998
id ftsorbonneuniv:oai:HAL:hal-01848998v1
record_format openpolar
spelling ftsorbonneuniv:oai:HAL:hal-01848998v1 2024-09-15T18:13:41+00:00 Construction of a French-LSF corpus Filhol, Michael Tannier, Xavier Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE) Reykjavík, Iceland 2014-05-01 https://hal.science/hal-01848998 en eng HAL CCSD hal-01848998 https://hal.science/hal-01848998 Workshop on Building and Using Comparable Corpora https://hal.science/hal-01848998 Workshop on Building and Using Comparable Corpora, May 2014, Reykjavík, Iceland [INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftsorbonneuniv 2024-07-25T23:48:06Z International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect of machine translation into Sign Language, we present the main problems posed when mixing language channels and modalities (oral, written, signed), discussing the translation-vs-interpretation narrative in particular. We describe the process followed to guarantee feature coverage and exploitable results despite a serious cost limitation, the data being collected from professional translations. We conclude with a few uses and prospects of the corpus. Conference Object Iceland Reykjavík Reykjavík HAL Sorbonne Université
institution Open Polar
collection HAL Sorbonne Université
op_collection_id ftsorbonneuniv
language English
topic [INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
spellingShingle [INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Filhol, Michael
Tannier, Xavier
Construction of a French-LSF corpus
topic_facet [INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
description International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect of machine translation into Sign Language, we present the main problems posed when mixing language channels and modalities (oral, written, signed), discussing the translation-vs-interpretation narrative in particular. We describe the process followed to guarantee feature coverage and exploitable results despite a serious cost limitation, the data being collected from professional translations. We conclude with a few uses and prospects of the corpus.
author2 Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI)
Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919)
Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE)
format Conference Object
author Filhol, Michael
Tannier, Xavier
author_facet Filhol, Michael
Tannier, Xavier
author_sort Filhol, Michael
title Construction of a French-LSF corpus
title_short Construction of a French-LSF corpus
title_full Construction of a French-LSF corpus
title_fullStr Construction of a French-LSF corpus
title_full_unstemmed Construction of a French-LSF corpus
title_sort construction of a french-lsf corpus
publisher HAL CCSD
publishDate 2014
url https://hal.science/hal-01848998
op_coverage Reykjavík, Iceland
genre Iceland
Reykjavík
Reykjavík
genre_facet Iceland
Reykjavík
Reykjavík
op_source Workshop on Building and Using Comparable Corpora
https://hal.science/hal-01848998
Workshop on Building and Using Comparable Corpora, May 2014, Reykjavík, Iceland
op_relation hal-01848998
https://hal.science/hal-01848998
_version_ 1810451446813949952