Semantic clustering of pivot paraphrases

International audience Paraphrases extracted from parallel corpora by the pivot method (Bannard and Callison-Burch, 2005) constitute a valuable resource for multilingual NLP applications. In this study, we analyse the semantics of unigram pivot paraphrases and use a graph-based sense induction appro...

Full description

Bibliographic Details
Main Authors: Apidianaki, Marianna, Verzeni, Emilia, Mccarthy, Diana
Other Authors: Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI), Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919), Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.science/hal-01838559
id ftsorbonneuniv:oai:HAL:hal-01838559v1
record_format openpolar
spelling ftsorbonneuniv:oai:HAL:hal-01838559v1 2023-11-05T03:42:53+01:00 Semantic clustering of pivot paraphrases Apidianaki, Marianna Verzeni, Emilia Mccarthy, Diana Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919) Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE) Reykjavik, Iceland 2014-01-01 https://hal.science/hal-01838559 en eng HAL CCSD hal-01838559 https://hal.science/hal-01838559 International Conference on Language Resources and Evaluation https://hal.science/hal-01838559 International Conference on Language Resources and Evaluation, Jan 2014, Reykjavik, Iceland pivot paraphrasing sense clustering parallel corpora [INFO]Computer Science [cs] [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftsorbonneuniv 2023-10-10T23:58:35Z International audience Paraphrases extracted from parallel corpora by the pivot method (Bannard and Callison-Burch, 2005) constitute a valuable resource for multilingual NLP applications. In this study, we analyse the semantics of unigram pivot paraphrases and use a graph-based sense induction approach to unveil hidden sense distinctions in the paraphrase sets. The comparison of the acquired senses to gold data from the Lexical Substitution shared task (McCarthy and Navigli, 2007) demonstrates that sense distinctions exist in the paraphrase sets and highlights the need for a disambiguation step in applications using this resource. Conference Object Iceland HAL Sorbonne Université
institution Open Polar
collection HAL Sorbonne Université
op_collection_id ftsorbonneuniv
language English
topic pivot paraphrasing
sense clustering
parallel corpora
[INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
spellingShingle pivot paraphrasing
sense clustering
parallel corpora
[INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Apidianaki, Marianna
Verzeni, Emilia
Mccarthy, Diana
Semantic clustering of pivot paraphrases
topic_facet pivot paraphrasing
sense clustering
parallel corpora
[INFO]Computer Science [cs]
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
description International audience Paraphrases extracted from parallel corpora by the pivot method (Bannard and Callison-Burch, 2005) constitute a valuable resource for multilingual NLP applications. In this study, we analyse the semantics of unigram pivot paraphrases and use a graph-based sense induction approach to unveil hidden sense distinctions in the paraphrase sets. The comparison of the acquired senses to gold data from the Lexical Substitution shared task (McCarthy and Navigli, 2007) demonstrates that sense distinctions exist in the paraphrase sets and highlights the need for a disambiguation step in applications using this resource.
author2 Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI)
Université Paris-Sud - Paris 11 (UP11)-Sorbonne Université - UFR d'Ingénierie (UFR 919)
Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Paris Saclay (COmUE)
format Conference Object
author Apidianaki, Marianna
Verzeni, Emilia
Mccarthy, Diana
author_facet Apidianaki, Marianna
Verzeni, Emilia
Mccarthy, Diana
author_sort Apidianaki, Marianna
title Semantic clustering of pivot paraphrases
title_short Semantic clustering of pivot paraphrases
title_full Semantic clustering of pivot paraphrases
title_fullStr Semantic clustering of pivot paraphrases
title_full_unstemmed Semantic clustering of pivot paraphrases
title_sort semantic clustering of pivot paraphrases
publisher HAL CCSD
publishDate 2014
url https://hal.science/hal-01838559
op_coverage Reykjavik, Iceland
genre Iceland
genre_facet Iceland
op_source International Conference on Language Resources and Evaluation
https://hal.science/hal-01838559
International Conference on Language Resources and Evaluation, Jan 2014, Reykjavik, Iceland
op_relation hal-01838559
https://hal.science/hal-01838559
_version_ 1781700452311629824