Semi-compositional Method for Synonym Extraction of Multi-Word Terms
International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few inv...
Main Authors: | , |
---|---|
Other Authors: | , , , , , , , , , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2014
|
Subjects: | |
Online Access: | https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |
_version_ | 1821552823904501760 |
---|---|
author | Hazem, Amir Daille, Béatrice |
author2 | Laboratoire d'Informatique de Nantes Atlantique (LINA) Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS) Traitement Automatique du Langage Naturel (LS2N - équipe TALN ) Laboratoire des Sciences du Numérique de Nantes (LS2N) Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT) ELRA ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012) |
author_facet | Hazem, Amir Daille, Béatrice |
author_sort | Hazem, Amir |
collection | Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
description | International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art. |
format | Conference Object |
genre | Iceland |
genre_facet | Iceland |
id | ftccsdartic:oai:HAL:hal-01171093v1 |
institution | Open Polar |
language | English |
op_collection_id | ftccsdartic |
op_coverage | Reykjavik, Iceland |
op_relation | hal-01171093 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |
op_rights | info:eu-repo/semantics/OpenAccess |
op_source | 9th edition of the Language Resources and Evaluation Conference (LREC 2014) https://hal.science/hal-01171093 9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland |
publishDate | 2014 |
publisher | HAL CCSD |
record_format | openpolar |
spelling | ftccsdartic:oai:HAL:hal-01171093v1 2025-01-16T22:36:04+00:00 Semi-compositional Method for Synonym Extraction of Multi-Word Terms Hazem, Amir Daille, Béatrice Laboratoire d'Informatique de Nantes Atlantique (LINA) Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS) Traitement Automatique du Langage Naturel (LS2N - équipe TALN ) Laboratoire des Sciences du Numérique de Nantes (LS2N) Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT) ELRA ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf en eng HAL CCSD hal-01171093 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf info:eu-repo/semantics/OpenAccess 9th edition of the Language Resources and Evaluation Conference (LREC 2014) https://hal.science/hal-01171093 9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftccsdartic 2024-01-06T23:42:11Z International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art. Conference Object Iceland Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
spellingShingle | Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] Hazem, Amir Daille, Béatrice Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title | Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_full | Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_fullStr | Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_full_unstemmed | Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_short | Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
title_sort | semi-compositional method for synonym extraction of multi-word terms |
topic | Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
topic_facet | Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] |
url | https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf |