Semi-compositional Method for Synonym Extraction of Multi-Word Terms
International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few inv...
Main Authors: | , |
---|---|
Other Authors: | , , , , , , , , , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2014
|
Subjects: | |
Online Access: | https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |
id |
ftccsdartic:oai:HAL:hal-01171093v1 |
---|---|
record_format |
openpolar |
spelling |
ftccsdartic:oai:HAL:hal-01171093v1 2024-02-04T10:01:28+01:00 Semi-compositional Method for Synonym Extraction of Multi-Word Terms Hazem, Amir Daille, Béatrice Laboratoire d'Informatique de Nantes Atlantique (LINA) Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS) Traitement Automatique du Langage Naturel (LS2N - équipe TALN ) Laboratoire des Sciences du Numérique de Nantes (LS2N) Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT) ELRA ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf en eng HAL CCSD hal-01171093 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf info:eu-repo/semantics/OpenAccess 9th edition of the Language Resources and Evaluation Conference (LREC 2014) https://hal.science/hal-01171093 9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftccsdartic 2024-01-06T23:42:11Z International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art. Conference Object Iceland Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
institution |
Open Polar |
collection |
Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
op_collection_id |
ftccsdartic |
language |
English |
topic |
Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
spellingShingle |
Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] Hazem, Amir Daille, Béatrice Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
topic_facet |
Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
description |
International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art. |
author2 |
Laboratoire d'Informatique de Nantes Atlantique (LINA) Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS) Traitement Automatique du Langage Naturel (LS2N - équipe TALN ) Laboratoire des Sciences du Numérique de Nantes (LS2N) Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT) ELRA ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012) |
format |
Conference Object |
author |
Hazem, Amir Daille, Béatrice |
author_facet |
Hazem, Amir Daille, Béatrice |
author_sort |
Hazem, Amir |
title |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_short |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_full |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_fullStr |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_full_unstemmed |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_sort |
semi-compositional method for synonym extraction of multi-word terms |
publisher |
HAL CCSD |
publishDate |
2014 |
url |
https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |
op_coverage |
Reykjavik, Iceland |
genre |
Iceland |
genre_facet |
Iceland |
op_source |
9th edition of the Language Resources and Evaluation Conference (LREC 2014) https://hal.science/hal-01171093 9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland |
op_relation |
hal-01171093 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |
op_rights |
info:eu-repo/semantics/OpenAccess |
_version_ |
1789967385244991488 |