Towards Unsupervised Morphological Analysis of Polysynthetic Languages ...

Polysynthetic languages present a challenge for morphological analysis due to the complexity of their words and the lack of high-quality annotated datasets needed to build and/or evaluate computational models. The contribution of this work is twofold. First, using linguists’ help, we generate and co...

Full description

Bibliographic Details
Main Authors: Association for Computational Linguistics 2022, Khandagale, Sujay
Format: Article in Journal/Newspaper
Language:unknown
Published: Underline Science Inc. 2022
Subjects:
Online Access:https://dx.doi.org/10.48448/8m5w-vq37
https://underline.io/lecture/64698-towards-unsupervised-morphological-analysis-of-polysynthetic-languages
Description
Summary:Polysynthetic languages present a challenge for morphological analysis due to the complexity of their words and the lack of high-quality annotated datasets needed to build and/or evaluate computational models. The contribution of this work is twofold. First, using linguists’ help, we generate and contribute high-quality annotated data for two low-resource polysynthetic languages for two tasks: morphological segmentation and part-of-speech (POS) tagging. Second, we present the results of state-of-the-art unsupervised approaches for these two tasks on Adyghe and Inuktitut. Our findings show that for these polysynthetic languages, using linguistic priors helps the task of morphological segmentation and that using stems rather than words as the core unit of abstraction leads to superior performance on POS tagging. To view the proceedings, please click here (https://aclanthology.org/2022.aacl-short.41/) ...