Generating and using probabilistic morphological resources for the biomedical domain

International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological...

Full description

Bibliographic Details
Main Authors: Claveau, Vincent, Kijak, Ewa
Other Authors: Multimedia content-based indexing (TEXMEX), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.science/hal-01027778
https://hal.science/hal-01027778/document
https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf
id ftinsarennhal:oai:HAL:hal-01027778v1
record_format openpolar
spelling ftinsarennhal:oai:HAL:hal-01027778v1 2023-05-15T16:49:23+02:00 Generating and using probabilistic morphological resources for the biomedical domain Claveau, Vincent Kijak, Ewa Multimedia content-based indexing (TEXMEX) Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA) Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique Institut National de Recherche en Informatique et en Automatique (Inria) Reykjavik, Iceland 2014-05-26 https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf en eng HAL CCSD hal-01027778 https://hal.science/hal-01027778 https://hal.science/hal-01027778/document https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf info:eu-repo/semantics/OpenAccess Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014 https://hal.science/hal-01027778 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p Morpho-semantic analysis biomedical terminology probabilistic morphological resources [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SCCO.LING]Cognitive science/Linguistics info:eu-repo/semantics/conferenceObject Conference papers 2014 ftinsarennhal 2023-03-15T18:21:43Z International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources. Conference Object Iceland INSA Rennes HAL (Institut National des Sciences Appliquées) Pivot ENVELOPE(-30.239,-30.239,-80.667,-80.667)
institution Open Polar
collection INSA Rennes HAL (Institut National des Sciences Appliquées)
op_collection_id ftinsarennhal
language English
topic Morpho-semantic analysis
biomedical terminology
probabilistic morphological resources
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
[SCCO.LING]Cognitive science/Linguistics
spellingShingle Morpho-semantic analysis
biomedical terminology
probabilistic morphological resources
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
[SCCO.LING]Cognitive science/Linguistics
Claveau, Vincent
Kijak, Ewa
Generating and using probabilistic morphological resources for the biomedical domain
topic_facet Morpho-semantic analysis
biomedical terminology
probabilistic morphological resources
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
[SCCO.LING]Cognitive science/Linguistics
description International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources.
author2 Multimedia content-based indexing (TEXMEX)
Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA)
Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes)
Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes)
Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique
Institut National de Recherche en Informatique et en Automatique (Inria)
format Conference Object
author Claveau, Vincent
Kijak, Ewa
author_facet Claveau, Vincent
Kijak, Ewa
author_sort Claveau, Vincent
title Generating and using probabilistic morphological resources for the biomedical domain
title_short Generating and using probabilistic morphological resources for the biomedical domain
title_full Generating and using probabilistic morphological resources for the biomedical domain
title_fullStr Generating and using probabilistic morphological resources for the biomedical domain
title_full_unstemmed Generating and using probabilistic morphological resources for the biomedical domain
title_sort generating and using probabilistic morphological resources for the biomedical domain
publisher HAL CCSD
publishDate 2014
url https://hal.science/hal-01027778
https://hal.science/hal-01027778/document
https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf
op_coverage Reykjavik, Iceland
long_lat ENVELOPE(-30.239,-30.239,-80.667,-80.667)
geographic Pivot
geographic_facet Pivot
genre Iceland
genre_facet Iceland
op_source Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014
9th edition of the Language Resources and Evaluation Conference, LREC 2014
https://hal.science/hal-01027778
9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p
op_relation hal-01027778
https://hal.science/hal-01027778
https://hal.science/hal-01027778/document
https://hal.science/hal-01027778/file/Claveau_Kijak_LREC14.pdf
op_rights info:eu-repo/semantics/OpenAccess
_version_ 1766039530944593920