Semi-compositional Method for Synonym Extraction of Multi-Word Terms

International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few inv...

Full description

Bibliographic Details
Main Authors: Hazem, Amir, Daille, Béatrice
Other Authors: Laboratoire d'Informatique de Nantes Atlantique (LINA), Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST), Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS), Traitement Automatique du Langage Naturel (LS2N - équipe TALN ), Laboratoire des Sciences du Numérique de Nantes (LS2N), Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST), Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST), Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT), ELRA, ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.science/hal-01171093
https://hal.science/hal-01171093/document
https://hal.science/hal-01171093/file/679_Paper.pdf
id ftanrparis:oai:HAL:hal-01171093v1
record_format openpolar
spelling ftanrparis:oai:HAL:hal-01171093v1 2024-02-04T10:01:28+01:00 Semi-compositional Method for Synonym Extraction of Multi-Word Terms Hazem, Amir Daille, Béatrice Laboratoire d'Informatique de Nantes Atlantique (LINA) Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS) Traitement Automatique du Langage Naturel (LS2N - équipe TALN ) Laboratoire des Sciences du Numérique de Nantes (LS2N) Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST) Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT) ELRA ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf en eng HAL CCSD hal-01171093 https://hal.science/hal-01171093 https://hal.science/hal-01171093/document https://hal.science/hal-01171093/file/679_Paper.pdf info:eu-repo/semantics/OpenAccess 9th edition of the Language Resources and Evaluation Conference (LREC 2014) https://hal.science/hal-01171093 9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland Synonyms Multi-word terms Compositionality Distributional semantics Unsupervised methods [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftanrparis 2024-01-06T22:27:59Z International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art. Conference Object Iceland Portail HAL-ANR (Agence Nationale de la Recherche)
institution Open Polar
collection Portail HAL-ANR (Agence Nationale de la Recherche)
op_collection_id ftanrparis
language English
topic Synonyms
Multi-word terms
Compositionality
Distributional semantics
Unsupervised methods
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
spellingShingle Synonyms
Multi-word terms
Compositionality
Distributional semantics
Unsupervised methods
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Hazem, Amir
Daille, Béatrice
Semi-compositional Method for Synonym Extraction of Multi-Word Terms
topic_facet Synonyms
Multi-word terms
Compositionality
Distributional semantics
Unsupervised methods
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
description International audience Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art.
author2 Laboratoire d'Informatique de Nantes Atlantique (LINA)
Mines Nantes (Mines Nantes)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS)
Traitement Automatique du Langage Naturel (LS2N - équipe TALN )
Laboratoire des Sciences du Numérique de Nantes (LS2N)
Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique)
Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)
ELRA
ANR-12-CORD-0029,TermITH,TERMinologie et Indexation de Textes en sciences Humaines(2012)
format Conference Object
author Hazem, Amir
Daille, Béatrice
author_facet Hazem, Amir
Daille, Béatrice
author_sort Hazem, Amir
title Semi-compositional Method for Synonym Extraction of Multi-Word Terms
title_short Semi-compositional Method for Synonym Extraction of Multi-Word Terms
title_full Semi-compositional Method for Synonym Extraction of Multi-Word Terms
title_fullStr Semi-compositional Method for Synonym Extraction of Multi-Word Terms
title_full_unstemmed Semi-compositional Method for Synonym Extraction of Multi-Word Terms
title_sort semi-compositional method for synonym extraction of multi-word terms
publisher HAL CCSD
publishDate 2014
url https://hal.science/hal-01171093
https://hal.science/hal-01171093/document
https://hal.science/hal-01171093/file/679_Paper.pdf
op_coverage Reykjavik, Iceland
genre Iceland
genre_facet Iceland
op_source 9th edition of the Language Resources and Evaluation Conference (LREC 2014)
https://hal.science/hal-01171093
9th edition of the Language Resources and Evaluation Conference (LREC 2014), ELRA, May 2014, Reykjavik, Iceland
op_relation hal-01171093
https://hal.science/hal-01171093
https://hal.science/hal-01171093/document
https://hal.science/hal-01171093/file/679_Paper.pdf
op_rights info:eu-repo/semantics/OpenAccess
_version_ 1789967386355433472