Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...

Polysynthetic languages present a challenge for morphological analysis due to the complexity of their words and the lack of high-quality annotated datasets needed to build and/or evaluate computational models. The contribution of this work is twofold. First, using linguists’ help, we generate and co...

Full description

Bibliographic Details
Main Authors: Association for Computational Linguistics 2022, Khandagale, Sujay
Format: Article in Journal/Newspaper
Language:unknown
Published: Underline Science Inc. 2022
Subjects:
Online Access:https://dx.doi.org/10.48448/8m5w-vq37
https://underline.io/lecture/64698-towards-unsupervised-morphological-analysis-of-polysynthetic-languages
id ftdatacite:10.48448/8m5w-vq37
record_format openpolar
spelling ftdatacite:10.48448/8m5w-vq37 2023-05-15T16:55:35+02:00 Towards Unsupervised Morphological Analysis of Polysynthetic Languages ... Association for Computational Linguistics 2022 Khandagale, Sujay 2022 https://dx.doi.org/10.48448/8m5w-vq37 https://underline.io/lecture/64698-towards-unsupervised-morphological-analysis-of-polysynthetic-languages unknown Underline Science Inc. Natural Language Processing MediaObject Audiovisual article Conference talk 2022 ftdatacite https://doi.org/10.48448/8m5w-vq37 2023-04-03T13:14:08Z Polysynthetic languages present a challenge for morphological analysis due to the complexity of their words and the lack of high-quality annotated datasets needed to build and/or evaluate computational models. The contribution of this work is twofold. First, using linguists’ help, we generate and contribute high-quality annotated data for two low-resource polysynthetic languages for two tasks: morphological segmentation and part-of-speech (POS) tagging. Second, we present the results of state-of-the-art unsupervised approaches for these two tasks on Adyghe and Inuktitut. Our findings show that for these polysynthetic languages, using linguistic priors helps the task of morphological segmentation and that using stems rather than words as the core unit of abstraction leads to superior performance on POS tagging. To view the proceedings, please click here (https://aclanthology.org/2022.aacl-short.41/) ... Article in Journal/Newspaper inuktitut DataCite Metadata Store (German National Library of Science and Technology)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Natural Language Processing
spellingShingle Natural Language Processing
Association for Computational Linguistics 2022
Khandagale, Sujay
Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
topic_facet Natural Language Processing
description Polysynthetic languages present a challenge for morphological analysis due to the complexity of their words and the lack of high-quality annotated datasets needed to build and/or evaluate computational models. The contribution of this work is twofold. First, using linguists’ help, we generate and contribute high-quality annotated data for two low-resource polysynthetic languages for two tasks: morphological segmentation and part-of-speech (POS) tagging. Second, we present the results of state-of-the-art unsupervised approaches for these two tasks on Adyghe and Inuktitut. Our findings show that for these polysynthetic languages, using linguistic priors helps the task of morphological segmentation and that using stems rather than words as the core unit of abstraction leads to superior performance on POS tagging. To view the proceedings, please click here (https://aclanthology.org/2022.aacl-short.41/) ...
format Article in Journal/Newspaper
author Association for Computational Linguistics 2022
Khandagale, Sujay
author_facet Association for Computational Linguistics 2022
Khandagale, Sujay
author_sort Association for Computational Linguistics 2022
title Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
title_short Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
title_full Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
title_fullStr Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
title_full_unstemmed Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...
title_sort towards unsupervised morphological analysis of polysynthetic languages ...
publisher Underline Science Inc.
publishDate 2022
url https://dx.doi.org/10.48448/8m5w-vq37
https://underline.io/lecture/64698-towards-unsupervised-morphological-analysis-of-polysynthetic-languages
genre inuktitut
genre_facet inuktitut
op_doi https://doi.org/10.48448/8m5w-vq37
_version_ 1766046584777211904