Construction of a French-LSF corpus
International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect o...
Main Authors: | , |
---|---|
Other Authors: | , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2014
|
Subjects: | |
Online Access: | https://hal.archives-ouvertes.fr/hal-01848998 |
id |
ftccsdartic:oai:HAL:hal-01848998v1 |
---|---|
record_format |
openpolar |
spelling |
ftccsdartic:oai:HAL:hal-01848998v1 2023-05-15T16:48:55+02:00 Construction of a French-LSF corpus Filhol, Michael Tannier, Xavier Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris Saclay (COmUE)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Université Paris-Sud - Paris 11 (UP11) Reykjavík, Iceland 2014-05-01 https://hal.archives-ouvertes.fr/hal-01848998 en eng HAL CCSD hal-01848998 https://hal.archives-ouvertes.fr/hal-01848998 Workshop on Building and Using Comparable Corpora https://hal.archives-ouvertes.fr/hal-01848998 Workshop on Building and Using Comparable Corpora, May 2014, Reykjavík, Iceland [INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftccsdartic 2021-12-19T02:16:24Z International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect of machine translation into Sign Language, we present the main problems posed when mixing language channels and modalities (oral, written, signed), discussing the translation-vs-interpretation narrative in particular. We describe the process followed to guarantee feature coverage and exploitable results despite a serious cost limitation, the data being collected from professional translations. We conclude with a few uses and prospects of the corpus. Conference Object Iceland Reykjavík Reykjavík Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) Reykjavík |
institution |
Open Polar |
collection |
Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
op_collection_id |
ftccsdartic |
language |
English |
topic |
[INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
spellingShingle |
[INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] Filhol, Michael Tannier, Xavier Construction of a French-LSF corpus |
topic_facet |
[INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
description |
International audience In this article, we present the first academic comparable corpus involving written French and French Sign Language. After explaining our initial motivation to build a parallel set of such data, especially in the context of our work on Sign Language modelling and our prospect of machine translation into Sign Language, we present the main problems posed when mixing language channels and modalities (oral, written, signed), discussing the translation-vs-interpretation narrative in particular. We describe the process followed to guarantee feature coverage and exploitable results despite a serious cost limitation, the data being collected from professional translations. We conclude with a few uses and prospects of the corpus. |
author2 |
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris Saclay (COmUE)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Université Paris-Sud - Paris 11 (UP11) |
format |
Conference Object |
author |
Filhol, Michael Tannier, Xavier |
author_facet |
Filhol, Michael Tannier, Xavier |
author_sort |
Filhol, Michael |
title |
Construction of a French-LSF corpus |
title_short |
Construction of a French-LSF corpus |
title_full |
Construction of a French-LSF corpus |
title_fullStr |
Construction of a French-LSF corpus |
title_full_unstemmed |
Construction of a French-LSF corpus |
title_sort |
construction of a french-lsf corpus |
publisher |
HAL CCSD |
publishDate |
2014 |
url |
https://hal.archives-ouvertes.fr/hal-01848998 |
op_coverage |
Reykjavík, Iceland |
geographic |
Reykjavík |
geographic_facet |
Reykjavík |
genre |
Iceland Reykjavík Reykjavík |
genre_facet |
Iceland Reykjavík Reykjavík |
op_source |
Workshop on Building and Using Comparable Corpora https://hal.archives-ouvertes.fr/hal-01848998 Workshop on Building and Using Comparable Corpora, May 2014, Reykjavík, Iceland |
op_relation |
hal-01848998 https://hal.archives-ouvertes.fr/hal-01848998 |
_version_ |
1766039002489552896 |