Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology

International audience This paper investigates the evolution of the computational linguistics domain through a quantitative analysis of the ACL Anthology (containing around 12,000 papers published between 1985 and 2008). Our approach combines complex system methods with natural language processing t...

Full description

Bibliographic Details
Main Authors: Omodei, Elisa, Cointet, Jean-Philippe, Poibeau, Thierry
Other Authors: Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice), Université Sorbonne Nouvelle - Paris 3-Université Sorbonne Paris Cité (USPC)-Centre National de la Recherche Scientifique (CNRS)-Université Paris sciences et lettres (PSL)-Département Littératures et langage (LILA), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL), Sciences en Société (SenS), Institut National de la Recherche Agronomique (INRA), United Nations Educational Scientific and Cultural Organization (UNESCO). INT.
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.archives-ouvertes.fr/hal-01056147
https://hal.archives-ouvertes.fr/hal-01056147/document
https://hal.archives-ouvertes.fr/hal-01056147/file/paper_lrec.pdf
id ftccsdartic:oai:HAL:hal-01056147v1
record_format openpolar
spelling ftccsdartic:oai:HAL:hal-01056147v1 2023-05-15T16:50:23+02:00 Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology Omodei, Elisa Cointet, Jean-Philippe Poibeau, Thierry Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice) Université Sorbonne Nouvelle - Paris 3-Université Sorbonne Paris Cité (USPC)-Centre National de la Recherche Scientifique (CNRS)-Université Paris sciences et lettres (PSL)-Département Littératures et langage (LILA) École normale supérieure - Paris (ENS Paris) Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris) Université Paris sciences et lettres (PSL) Sciences en Société (SenS) Institut National de la Recherche Agronomique (INRA) United Nations Educational Scientific and Cultural Organization (UNESCO). INT. Reykjavik, Iceland 2014-05-26 https://hal.archives-ouvertes.fr/hal-01056147 https://hal.archives-ouvertes.fr/hal-01056147/document https://hal.archives-ouvertes.fr/hal-01056147/file/paper_lrec.pdf en eng HAL CCSD ELRA ISBN: 978-2-9517408-8-4 9782951740884 hal-01056147 https://hal.archives-ouvertes.fr/hal-01056147 https://hal.archives-ouvertes.fr/hal-01056147/document https://hal.archives-ouvertes.fr/hal-01056147/file/paper_lrec.pdf PRODINRA: 314112 WOS: 000355611004096 info:eu-repo/semantics/OpenAccess Proceedings of LREC 2014, the Ninth International Conference on Language Resources and Evaluation LREC 2014, the Ninth International Conference on Language Resources and Evaluation https://hal.archives-ouvertes.fr/hal-01056147 LREC 2014, the Ninth International Conference on Language Resources and Evaluation, United Nations Educational Scientific and Cultural Organization (UNESCO). INT., May 2014, Reykjavik, Iceland. pp.2972-2979 http://www.lrec-conf.org/proceedings/lrec2014/index.html ACL Anthology semantic network social network [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] [SCCO.COMP]Cognitive science/Computer science [SCCO.LING]Cognitive science/Linguistics [SHS.LANGUE]Humanities and Social Sciences/Linguistics [SHS.INFO]Humanities and Social Sciences/Library and information sciences info:eu-repo/semantics/conferenceObject Conference papers 2014 ftccsdartic 2021-11-21T03:25:05Z International audience This paper investigates the evolution of the computational linguistics domain through a quantitative analysis of the ACL Anthology (containing around 12,000 papers published between 1985 and 2008). Our approach combines complex system methods with natural language processing techniques. We reconstruct the socio-semantic landscape of the domain by inferring a co-authorship and a semantic network from the analysis of the corpus. First, keywords are extracted using a hybrid approach mixing linguistic patterns with statistical information. Then, the semantic network is built using a co-occurrence analysis of these keywords within the corpus. Combining temporal and network analysis techniques, we are able to examine the main evolutions of the field and the more active subfields over time. Lastly we propose a model to explore the mutual influence of the social and the semantic network over time, leading to a socio-semantic co-evolutionary system. Conference Object Iceland Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
institution Open Polar
collection Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id ftccsdartic
language English
topic ACL Anthology
semantic network
social network
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
[SCCO.COMP]Cognitive science/Computer science
[SCCO.LING]Cognitive science/Linguistics
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
[SHS.INFO]Humanities and Social Sciences/Library and information sciences
spellingShingle ACL Anthology
semantic network
social network
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
[SCCO.COMP]Cognitive science/Computer science
[SCCO.LING]Cognitive science/Linguistics
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
[SHS.INFO]Humanities and Social Sciences/Library and information sciences
Omodei, Elisa
Cointet, Jean-Philippe
Poibeau, Thierry
Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
topic_facet ACL Anthology
semantic network
social network
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
[SCCO.COMP]Cognitive science/Computer science
[SCCO.LING]Cognitive science/Linguistics
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
[SHS.INFO]Humanities and Social Sciences/Library and information sciences
description International audience This paper investigates the evolution of the computational linguistics domain through a quantitative analysis of the ACL Anthology (containing around 12,000 papers published between 1985 and 2008). Our approach combines complex system methods with natural language processing techniques. We reconstruct the socio-semantic landscape of the domain by inferring a co-authorship and a semantic network from the analysis of the corpus. First, keywords are extracted using a hybrid approach mixing linguistic patterns with statistical information. Then, the semantic network is built using a co-occurrence analysis of these keywords within the corpus. Combining temporal and network analysis techniques, we are able to examine the main evolutions of the field and the more active subfields over time. Lastly we propose a model to explore the mutual influence of the social and the semantic network over time, leading to a socio-semantic co-evolutionary system.
author2 Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice)
Université Sorbonne Nouvelle - Paris 3-Université Sorbonne Paris Cité (USPC)-Centre National de la Recherche Scientifique (CNRS)-Université Paris sciences et lettres (PSL)-Département Littératures et langage (LILA)
École normale supérieure - Paris (ENS Paris)
Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris)
Université Paris sciences et lettres (PSL)
Sciences en Société (SenS)
Institut National de la Recherche Agronomique (INRA)
United Nations Educational Scientific and Cultural Organization (UNESCO). INT.
format Conference Object
author Omodei, Elisa
Cointet, Jean-Philippe
Poibeau, Thierry
author_facet Omodei, Elisa
Cointet, Jean-Philippe
Poibeau, Thierry
author_sort Omodei, Elisa
title Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
title_short Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
title_full Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
title_fullStr Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
title_full_unstemmed Mapping the Natural Language Processing Domain: Experiments using the ACL Anthology
title_sort mapping the natural language processing domain: experiments using the acl anthology
publisher HAL CCSD
publishDate 2014
url https://hal.archives-ouvertes.fr/hal-01056147
https://hal.archives-ouvertes.fr/hal-01056147/document
https://hal.archives-ouvertes.fr/hal-01056147/file/paper_lrec.pdf
op_coverage Reykjavik, Iceland
genre Iceland
genre_facet Iceland
op_source Proceedings of LREC 2014, the Ninth International Conference on Language Resources and Evaluation
LREC 2014, the Ninth International Conference on Language Resources and Evaluation
https://hal.archives-ouvertes.fr/hal-01056147
LREC 2014, the Ninth International Conference on Language Resources and Evaluation, United Nations Educational Scientific and Cultural Organization (UNESCO). INT., May 2014, Reykjavik, Iceland. pp.2972-2979
http://www.lrec-conf.org/proceedings/lrec2014/index.html
op_relation ISBN: 978-2-9517408-8-4 9782951740884
hal-01056147
https://hal.archives-ouvertes.fr/hal-01056147
https://hal.archives-ouvertes.fr/hal-01056147/document
https://hal.archives-ouvertes.fr/hal-01056147/file/paper_lrec.pdf
PRODINRA: 314112
WOS: 000355611004096
op_rights info:eu-repo/semantics/OpenAccess
_version_ 1766040544972111872