Generating and using probabilistic morphological resources for the biomedical domain
International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological...
Main Authors: | , |
---|---|
Other Authors: | , , , , , , |
Format: | Other/Unknown Material |
Language: | English |
Published: |
HAL CCSD
2014
|
Subjects: | |
Online Access: | https://hal.archives-ouvertes.fr/hal-01027778/file/Claveau_Kijak_LREC14.pdf https://hal.archives-ouvertes.fr/hal-01027778 |
id |
fttriple:oai:gotriple.eu:10670/1.ohg6o6 |
---|---|
record_format |
openpolar |
spelling |
fttriple:oai:gotriple.eu:10670/1.ohg6o6 2023-05-15T16:49:17+02:00 Generating and using probabilistic morphological resources for the biomedical domain Claveau, Vincent Kijak, Ewa Multimedia content-based indexing (TEXMEX) Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA) Université de Rennes 1 (UR1) Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1) Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique Institut National de Recherche en Informatique et en Automatique (Inria) Reykjavik, Iceland 2014-05-26 https://hal.archives-ouvertes.fr/hal-01027778/file/Claveau_Kijak_LREC14.pdf https://hal.archives-ouvertes.fr/hal-01027778 en eng HAL CCSD hal-01027778 10670/1.ohg6o6 https://hal.archives-ouvertes.fr/hal-01027778/file/Claveau_Kijak_LREC14.pdf https://hal.archives-ouvertes.fr/hal-01027778 other Hyper Article en Ligne - Sciences de l'Homme et de la Société Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p Morpho-semantic analysis biomedical terminology probabilistic morphological resources lang anthro-se Conference Output https://vocabularies.coar-repositories.org/resource_types/c_c94f/ 2014 fttriple 2023-01-22T17:06:11Z International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources. Other/Unknown Material Iceland Unknown Pivot ENVELOPE(-30.239,-30.239,-80.667,-80.667) |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
fttriple |
language |
English |
topic |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources lang anthro-se |
spellingShingle |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources lang anthro-se Claveau, Vincent Kijak, Ewa Generating and using probabilistic morphological resources for the biomedical domain |
topic_facet |
Morpho-semantic analysis biomedical terminology probabilistic morphological resources lang anthro-se |
description |
International audience In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau and Kijak, 2011; Claveau, 2012), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds. When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources. |
author2 |
Multimedia content-based indexing (TEXMEX) Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA) Université de Rennes 1 (UR1) Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1) Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Rennes – Bretagne Atlantique Institut National de Recherche en Informatique et en Automatique (Inria) |
format |
Other/Unknown Material |
author |
Claveau, Vincent Kijak, Ewa |
author_facet |
Claveau, Vincent Kijak, Ewa |
author_sort |
Claveau, Vincent |
title |
Generating and using probabilistic morphological resources for the biomedical domain |
title_short |
Generating and using probabilistic morphological resources for the biomedical domain |
title_full |
Generating and using probabilistic morphological resources for the biomedical domain |
title_fullStr |
Generating and using probabilistic morphological resources for the biomedical domain |
title_full_unstemmed |
Generating and using probabilistic morphological resources for the biomedical domain |
title_sort |
generating and using probabilistic morphological resources for the biomedical domain |
publisher |
HAL CCSD |
publishDate |
2014 |
url |
https://hal.archives-ouvertes.fr/hal-01027778/file/Claveau_Kijak_LREC14.pdf https://hal.archives-ouvertes.fr/hal-01027778 |
op_coverage |
Reykjavik, Iceland |
long_lat |
ENVELOPE(-30.239,-30.239,-80.667,-80.667) |
geographic |
Pivot |
geographic_facet |
Pivot |
genre |
Iceland |
genre_facet |
Iceland |
op_source |
Hyper Article en Ligne - Sciences de l'Homme et de la Société Proceedings of the 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014 9th edition of the Language Resources and Evaluation Conference, LREC 2014, May 2014, Reykjavik, Iceland. 7 p |
op_relation |
hal-01027778 10670/1.ohg6o6 https://hal.archives-ouvertes.fr/hal-01027778/file/Claveau_Kijak_LREC14.pdf https://hal.archives-ouvertes.fr/hal-01027778 |
op_rights |
other |
_version_ |
1766039442388156416 |