GLÀFF, a Large Versatile French Lexicon

International audience This paper introduces GLAFF, a large-scale versatile French lexicon extracted from Wiktionary, the collaborative online dictionary. GLAFF contains, for each entry, inflectional features and phonemic transcriptions. It distinguishes itself from the other available French lexico...

Full description

Bibliographic Details
Main Authors: Hathout, Nabil, Sajous, Franck, Calderone, Basilio
Other Authors: Cognition, Langues, Langage, Ergonomie (CLLE-ERSS), École pratique des hautes études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université Toulouse - Jean Jaurès (UT2J)-Université Bordeaux Montaigne-Centre National de la Recherche Scientifique (CNRS)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.science/hal-00998467
https://hal.science/hal-00998467/document
https://hal.science/hal-00998467/file/HathoutEtAl_LREC2014_GLAFF.pdf
id ftunivnantes:oai:HAL:hal-00998467v1
record_format openpolar
spelling ftunivnantes:oai:HAL:hal-00998467v1 2023-05-15T16:49:00+02:00 GLÀFF, a Large Versatile French Lexicon Hathout, Nabil Sajous, Franck Calderone, Basilio Cognition, Langues, Langage, Ergonomie (CLLE-ERSS) École pratique des hautes études (EPHE) Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université Toulouse - Jean Jaurès (UT2J)-Université Bordeaux Montaigne-Centre National de la Recherche Scientifique (CNRS) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-00998467 https://hal.science/hal-00998467/document https://hal.science/hal-00998467/file/HathoutEtAl_LREC2014_GLAFF.pdf en eng HAL CCSD hal-00998467 https://hal.science/hal-00998467 https://hal.science/hal-00998467/document https://hal.science/hal-00998467/file/HathoutEtAl_LREC2014_GLAFF.pdf info:eu-repo/semantics/OpenAccess Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) Conference on Language Resources and Evaluation (LREC) https://hal.science/hal-00998467 Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland. pp.1007-1012 Phonetic Databases Phonology Morphology [SHS.LANGUE]Humanities and Social Sciences/Linguistics info:eu-repo/semantics/conferenceObject Conference papers 2014 ftunivnantes 2023-01-31T23:40:42Z International audience This paper introduces GLAFF, a large-scale versatile French lexicon extracted from Wiktionary, the collaborative online dictionary. GLAFF contains, for each entry, inflectional features and phonemic transcriptions. It distinguishes itself from the other available French lexicons by its size, its potential for constant updating and its copylefted license. We explain how we have built GLAFF and compare it to other known resources in terms of coverage and quality of the phonemic transcriptions. We show that its size and quality are strong assets that could allow GLAFF to become a reference lexicon for French NLP and linguistics. Moreover, other derived lexicons can easily be based on GLAFF to satisfy specific needs of various fields such as psycholinguistics. Conference Object Iceland Université de Nantes: HAL-UNIV-NANTES
institution Open Polar
collection Université de Nantes: HAL-UNIV-NANTES
op_collection_id ftunivnantes
language English
topic Phonetic Databases
Phonology
Morphology
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
spellingShingle Phonetic Databases
Phonology
Morphology
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
Hathout, Nabil
Sajous, Franck
Calderone, Basilio
GLÀFF, a Large Versatile French Lexicon
topic_facet Phonetic Databases
Phonology
Morphology
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
description International audience This paper introduces GLAFF, a large-scale versatile French lexicon extracted from Wiktionary, the collaborative online dictionary. GLAFF contains, for each entry, inflectional features and phonemic transcriptions. It distinguishes itself from the other available French lexicons by its size, its potential for constant updating and its copylefted license. We explain how we have built GLAFF and compare it to other known resources in terms of coverage and quality of the phonemic transcriptions. We show that its size and quality are strong assets that could allow GLAFF to become a reference lexicon for French NLP and linguistics. Moreover, other derived lexicons can easily be based on GLAFF to satisfy specific needs of various fields such as psycholinguistics.
author2 Cognition, Langues, Langage, Ergonomie (CLLE-ERSS)
École pratique des hautes études (EPHE)
Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université Toulouse - Jean Jaurès (UT2J)-Université Bordeaux Montaigne-Centre National de la Recherche Scientifique (CNRS)
format Conference Object
author Hathout, Nabil
Sajous, Franck
Calderone, Basilio
author_facet Hathout, Nabil
Sajous, Franck
Calderone, Basilio
author_sort Hathout, Nabil
title GLÀFF, a Large Versatile French Lexicon
title_short GLÀFF, a Large Versatile French Lexicon
title_full GLÀFF, a Large Versatile French Lexicon
title_fullStr GLÀFF, a Large Versatile French Lexicon
title_full_unstemmed GLÀFF, a Large Versatile French Lexicon
title_sort glàff, a large versatile french lexicon
publisher HAL CCSD
publishDate 2014
url https://hal.science/hal-00998467
https://hal.science/hal-00998467/document
https://hal.science/hal-00998467/file/HathoutEtAl_LREC2014_GLAFF.pdf
op_coverage Reykjavik, Iceland
genre Iceland
genre_facet Iceland
op_source Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Conference on Language Resources and Evaluation (LREC)
https://hal.science/hal-00998467
Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland. pp.1007-1012
op_relation hal-00998467
https://hal.science/hal-00998467
https://hal.science/hal-00998467/document
https://hal.science/hal-00998467/file/HathoutEtAl_LREC2014_GLAFF.pdf
op_rights info:eu-repo/semantics/OpenAccess
_version_ 1766039065866534912