Corpus ANCOR Centre

ANCOR Centre is a French spoken corpus annotated in coreference whose size (488,000 words) is sufficient to investigate the achievement of data oriented systems of coreference resolution. The annotation was conducted on three different corpora of conversational speech (Accueil_UBS, OTG, ESLO). It is...

Full description

Bibliographic Details
Main Author:	Antoine, Jean-Yves Pr, LI
Other Authors:	Boyer-Pelletier, Aurore Ms, LLL, Muzerelle, Judith Ms, LLL, Desoyer, Adèle Ms, LATTICE, Lefeuvre, Anaïs Dr, LI, Schang, Emmanuel Dr, LLL, Tellier, Isabelle Pr, LATTICE, Landragin, Frederic Dr, LATTICE, Eskhol, Iris Dr, LLL, Maurel, Denis Pr, LI, Villaneau, Jeanne Dr, IRISA, Laboratoire d'Informatique (LI, Tours FR), http://www.li.univ-tours.fr, Laboratoire Ligérien de Linguistique (LLL, Orléans FR), http://www.lll.cnrs.fr, Langues, textes, traitements informatiques, cognition - UMR 8094 (LaTTiCe, Paris FR), http://www.lattice.cnrs.fr
Format:	Dataset
Language:	French
Published:	Laboratoire d'Informatique (LI, Tours FR) 2014
Subjects:	computational linguistics text and corpus linguistics general linguistics coreference anaphora conversational speech coréférence anaphore parole spontanée Iceland
Online Access:	http://hal.archives-ouvertes.fr/hal-01075679 http://www.taln2013.org/actes/www/TALN-2013/actes/taln-2013-court-007.pdf https://hal.archives-ouvertes.fr/hal-01016562 http://hdl.handle.net/11041/ortolang-000903 http://sldr.org/logo/LogoOrtolang_small.png http://hdl.handle.net/11041/ortolang-000903?urlappend=/toc http://hdl.handle.net/11041/ortolang-000903/Pres_ANCOR_Centre.pdf http://hdl.handle.net/11041/ortolang-000903/CreativeCommons.html http://hdl.handle.net/11041/ortolang-000903/oai_dc.xml http://hdl.handle.net/11041/ortolang-000903/olac.xml http://hdl.handle.net/11041/ortolang-000903/rdf.html

Description
Summary:	ANCOR Centre is a French spoken corpus annotated in coreference whose size (488,000 words) is sufficient to investigate the achievement of data oriented systems of coreference resolution. The annotation was conducted on three different corpora of conversational speech (Accueil_UBS, OTG, ESLO). It is freely available under Creative Commons CC-BY-SA or CC-BY-SA-NC licence ANCOR Centre est un corpus francophone d'envergure (488 000 mots) de parole spontanée annoté en anaphores et coréférences portant aussi bien sur des entités nominales que pronominales. L'annotation a été réalisée sur trois corpus de parole conversationnelle (Accueil_UBS, OTG et ESLO) diffusés également librement. Le corpus ANCOR_Centre est distribué gratuitement sous licence Creative Commons CC-BY-SA pour ce qui est des données concernant les corpus OTG, Accueil_UBS et CO2, et sous licence CC-BY-SA-NC pour le corpus lié à ESLO. Les sources audio (diffusées librement par ailleurs) liées à ce corpus ne font pas l'objet de cette distribution. MUZERELLE, J.; LEFEUVRE, A.; SCHANG, E.; ANTOINE, J.-Y; PELLETIER, A.; MAUREL, D.; ESHKOL, I.; VILLANEAU, J. (2014). ANCOR_Centre, a Large Free Spoken French Coreference Corpus: description of the Resource and Reliability Measures. LREC'2014, 9th Language Resources and Evaluation Conference., May 2014, Reyjavik, Iceland. http://hal.archives-ouvertes.fr/hal-01075679 Judith MUZERELLE, Anaïs LEFEUVRE, Jean-Yves ANTOINE, Emmanuel SCHANG, Denis MAUREL, Jeanne VILLANEAU, Iris ESHKOL (2013). ANCOR : premier corpus de français parlé d'envergure annoté en coréférence et distribué librement. Actes TALN'2013. Les Sables d'Olonnes, France [HAL 01016562]. http://www.taln2013.org/actes/www/TALN-2013/actes/taln-2013-court-007.pdf https://hal.archives-ouvertes.fr/hal-01016562 VERSION HISTORY: 1.0 version avec annotation déportée des coréférences au format Glozz et pointage des relations de coréférence sur la première mention (LI & LLL) 1.1 version avec ajout d'une version intégrée des annotations en chaînes de coréférence Work in progress: - réalisation d'une version compatible TEI - réalisation d'une version avec annotation déportée Glozz en chaînes de coréférences - réalisation d'une version avec annotation déportée en cluster de mentions coréférentes

Corpus ANCOR Centre

Similar Items