Generating and using probabilistic morphological resources for the biomedical domain
International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological...
Main Authors: | , |
---|---|
Other Authors: | , , , , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2014
|
Subjects: | |
Online Access: | https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf |
id |
ftunivrennes1hal:oai:HAL:hal-01027778v1 |
---|---|
record_format |
openpolar |
spelling |
ftunivrennes1hal:oai:HAL:hal-01027778v1 2023-05-15T16:49:28+02:00 Generating and using probabilistic morphological resources for the biomedical domain Claveau, Vincent Kijak, Ewa Multimedia content-based indexing (TEXMEX) Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA) Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique Institut National de Recherche en Informatique et en Automatique (Inria) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf en eng HAL CCSD hal-01027778 https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf info:eu-repo/semantics/OpenAccess Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014 https://hal.science/hal-01027778 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p Morpho-semantic analysis biomedical terminology probabilistic morphological resources [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SCCO.LING]Cognitive science/Linguistics info:eu-repo/semantics/conferenceObject Conference papers 2014 ftunivrennes1hal 2023-03-22T00:20:03Z International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources. Conference Object Iceland Université de Rennes 1: Publications scientifiques (HAL) Pivot ENVELOPE(-30.239,-30.239,-80.667,-80.667) |
institution |
Open Polar |
collection |
Université de Rennes 1: Publications scientifiques (HAL) |
op_collection_id |
ftunivrennes1hal |
language |
English |
topic |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SCCO.LING]Cognitive science/Linguistics |
spellingShingle |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SCCO.LING]Cognitive science/Linguistics Claveau, Vincent Kijak, Ewa Generating and using probabilistic morphological resources for the biomedical domain |
topic_facet |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SCCO.LING]Cognitive science/Linguistics |
description |
International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources. |
author2 |
Multimedia content-based indexing (TEXMEX) Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA) Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique Institut National de Recherche en Informatique et en Automatique (Inria) |
format |
Conference Object |
author |
Claveau, Vincent Kijak, Ewa |
author_facet |
Claveau, Vincent Kijak, Ewa |
author_sort |
Claveau, Vincent |
title |
Generating and using probabilistic morphological resources for the biomedical domain |
title_short |
Generating and using probabilistic morphological resources for the biomedical domain |
title_full |
Generating and using probabilistic morphological resources for the biomedical domain |
title_fullStr |
Generating and using probabilistic morphological resources for the biomedical domain |
title_full_unstemmed |
Generating and using probabilistic morphological resources for the biomedical domain |
title_sort |
generating and using probabilistic morphological resources for the biomedical domain |
publisher |
HAL CCSD |
publishDate |
2014 |
url |
https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf |
op_coverage |
Reykjavik, Iceland |
long_lat |
ENVELOPE(-30.239,-30.239,-80.667,-80.667) |
geographic |
Pivot |
geographic_facet |
Pivot |
genre |
Iceland |
genre_facet |
Iceland |
op_source |
Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014 https://hal.science/hal-01027778 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p |
op_relation |
hal-01027778 https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf |
op_rights |
info:eu-repo/semantics/OpenAccess |
_version_ |
1766039598362787840 |