A Free/Open-Source Morphological Analyser and Generator for Sakha

We present, to our knowledge, the first ever published morphological analyser and generator for Sakha, a marginalised language of Siberia. The transducer, developed using HFST, has coverage of solidly above 90%, and high precision. In the development of the analyser, we have expanded linguistic know...

Full description

Bibliographic Details
Main Authors: Ivanova, Sardana, Washington, Jonathan, Tyers, Francis M.
Other Authors: Department of Computer Science
Format: Conference Object
Language:English
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/10138/347045
Description
Summary:We present, to our knowledge, the first ever published morphological analyser and generator for Sakha, a marginalised language of Siberia. The transducer, developed using HFST, has coverage of solidly above 90%, and high precision. In the development of the analyser, we have expanded linguistic knowledge about Sakha, and developed strategies for complex grammatical patterns. The transducer is already being used in downstream tasks, including computer assisted language learning applications for linguistic maintenance and computational linguistic shared tasks. Peer reviewed