Turning language documentation into reader’s and writer’s software tools

One avenue for supporting the continued use and revitalization of endangered languages in the current, pervasively computerized world is the creation of computational models of the often rich and complex morphology of these languages. Such computational models can be used as a basis for creating a s...

Full description

Bibliographic Details
Main Authors: Arppe, Antti, Antonsen, Lene, Trosterud, Trond, Moshagen, Sjur, Thunder, Dorothy, Snoek, Conor, Mills, Timothy, Järvikivi, Juhani, Lachler, Jordan
Language:unknown
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10125/25317
id ftunivhawaiimano:oai:scholarspace.manoa.hawaii.edu:10125/25317
record_format openpolar
spelling ftunivhawaiimano:oai:scholarspace.manoa.hawaii.edu:10125/25317 2023-05-15T16:32:33+02:00 Turning language documentation into reader’s and writer’s software tools Arppe, Antti Antonsen, Lene Trosterud, Trond Moshagen, Sjur Thunder, Dorothy Snoek, Conor Mills, Timothy Järvikivi, Juhani Lachler, Jordan Arppe, Antti Antonsen, Lene Trosterud, Trond Moshagen, Sjur Thunder, Dorothy Snoek, Conor Mills, Timothy Järvikivi, Juhani Lachler, Jordan 2015-03-12 audio/mpeg http://hdl.handle.net/10125/25317 unknown http://hdl.handle.net/10125/25317 Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported CC-BY-NC-SA 2015 ftunivhawaiimano 2022-07-17T13:11:42Z One avenue for supporting the continued use and revitalization of endangered languages in the current, pervasively computerized world is the creation of computational models of the often rich and complex morphology of these languages. Such computational models can be used as a basis for creating a suite of reader’s and writer’s tools, including e.g. (1) an intelligent electronic dictionary that combines the computational model and a lexical database allowing for linking any inflected form with the appropriate dictionary entry, as well as the generation of word paradigms, (2) an intelligent computer-aided language learning application (ICALL) that allows for the dynamic generation of large numbers of exercises combining the entire core vocabulary (up to several thousand of the most common words) and a substantially smaller set of exercise templates, and (3) a spell-checker that supports adherence with one or more existing orthographical conventions, and thus the production of good-quality texts. Importantly, these tools can be made publicly available over the Internet and integrated as part of general software applications such as web browsers and word processors, to be used with little or no cost by any speakers or language-learners in the respective communities as well as any researchers, anywhere – instead of remaining on an individual researcher’s computer drive or on a library bookshelf. Based on our recent experiences on trying out various practical approaches in developing computational morphological models for Plains Cree and Northern Haida, using Finite-State Transducer (FST) technology (Beesley & Karttunen, 2003), once one gains access both to (a) a comprehensive set of full word paradigms, for every possible paradigm type, and (b) an accompanying extensive electronic lexical resource with coding indicating the relevant paradigm type, we have been able to create surprisingly rapidly, potentially within only several months, initial but already full-fledged FST models that can be readily adapted into ... Other/Unknown Material haida ScholarSpace at University of Hawaii at Manoa
institution Open Polar
collection ScholarSpace at University of Hawaii at Manoa
op_collection_id ftunivhawaiimano
language unknown
description One avenue for supporting the continued use and revitalization of endangered languages in the current, pervasively computerized world is the creation of computational models of the often rich and complex morphology of these languages. Such computational models can be used as a basis for creating a suite of reader’s and writer’s tools, including e.g. (1) an intelligent electronic dictionary that combines the computational model and a lexical database allowing for linking any inflected form with the appropriate dictionary entry, as well as the generation of word paradigms, (2) an intelligent computer-aided language learning application (ICALL) that allows for the dynamic generation of large numbers of exercises combining the entire core vocabulary (up to several thousand of the most common words) and a substantially smaller set of exercise templates, and (3) a spell-checker that supports adherence with one or more existing orthographical conventions, and thus the production of good-quality texts. Importantly, these tools can be made publicly available over the Internet and integrated as part of general software applications such as web browsers and word processors, to be used with little or no cost by any speakers or language-learners in the respective communities as well as any researchers, anywhere – instead of remaining on an individual researcher’s computer drive or on a library bookshelf. Based on our recent experiences on trying out various practical approaches in developing computational morphological models for Plains Cree and Northern Haida, using Finite-State Transducer (FST) technology (Beesley & Karttunen, 2003), once one gains access both to (a) a comprehensive set of full word paradigms, for every possible paradigm type, and (b) an accompanying extensive electronic lexical resource with coding indicating the relevant paradigm type, we have been able to create surprisingly rapidly, potentially within only several months, initial but already full-fledged FST models that can be readily adapted into ...
author2 Arppe, Antti
Antonsen, Lene
Trosterud, Trond
Moshagen, Sjur
Thunder, Dorothy
Snoek, Conor
Mills, Timothy
Järvikivi, Juhani
Lachler, Jordan
author Arppe, Antti
Antonsen, Lene
Trosterud, Trond
Moshagen, Sjur
Thunder, Dorothy
Snoek, Conor
Mills, Timothy
Järvikivi, Juhani
Lachler, Jordan
spellingShingle Arppe, Antti
Antonsen, Lene
Trosterud, Trond
Moshagen, Sjur
Thunder, Dorothy
Snoek, Conor
Mills, Timothy
Järvikivi, Juhani
Lachler, Jordan
Turning language documentation into reader’s and writer’s software tools
author_facet Arppe, Antti
Antonsen, Lene
Trosterud, Trond
Moshagen, Sjur
Thunder, Dorothy
Snoek, Conor
Mills, Timothy
Järvikivi, Juhani
Lachler, Jordan
author_sort Arppe, Antti
title Turning language documentation into reader’s and writer’s software tools
title_short Turning language documentation into reader’s and writer’s software tools
title_full Turning language documentation into reader’s and writer’s software tools
title_fullStr Turning language documentation into reader’s and writer’s software tools
title_full_unstemmed Turning language documentation into reader’s and writer’s software tools
title_sort turning language documentation into reader’s and writer’s software tools
publishDate 2015
url http://hdl.handle.net/10125/25317
genre haida
genre_facet haida
op_relation http://hdl.handle.net/10125/25317
op_rights Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
op_rightsnorm CC-BY-NC-SA
_version_ 1766022305461305344