Finite-state description, developing mental awareness

In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development...

Full description

Bibliographic Details
Main Author: Rueter, Jack
Other Authors: Hurskainen, Arvi, Koskenniemi, Kimmo, Pirinen, Tommi, Department of Digital Humanities
Format: Article in Journal/Newspaper
Language:English
Published: 2023
Subjects:
Online Access:http://hdl.handle.net/10138/357046
_version_ 1824237083583053824
author Rueter, Jack
author2 Hurskainen, Arvi
Koskenniemi, Kimmo
Pirinen, Tommi
Department of Digital Humanities
author_facet Rueter, Jack
author_sort Rueter, Jack
collection HELDA – University of Helsinki Open Repository
description In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development in the describer-developer. This presupposes regular interaction with developers of upstream and downstream technologies. And as more languages are described, the developer learns what to choose as a starting point, hopefully with the help of a researcher, research documentation or native speaker well versed in the workings of the language. We maintain that finite-state work should serve more than one purpose or audience, and that, as linguists, we should be raising the bar by applying the knowledge of research to description, so that our understanding of the linguistic phenomena can be attested by others or proven false. We are providing a methodology for repeatable experimentation and rule making. We see that each language provides something unique, while sharing some recognizable features with other languages. We stress the necessity to avoid generating characters from epsilons and offer examples where it is possible to write rules that reduce characters to epsilons instead. We also stress the need to describe the predictable infinite set of all native phenomena, whereas the unknown and random qualities introduced through language contact cannot form a foundation for our descriptions. Finally, we call for a playful approach to phenomena in a language, because that might bring us closer to how a child would learn the language – through repetition, mistakes and self-correction. Peer reviewed
format Article in Journal/Newspaper
genre saami
genre_facet saami
id ftunivhelsihelda:oai:helda.helsinki.fi:10138/357046
institution Open Polar
language English
op_collection_id ftunivhelsihelda
op_relation Rule-Based Language Technology
NEALT Monograph Series
Suomalais-Ugrilainen Seura
http://hdl.handle.net/10138/357046
op_rights unspecified
info:eu-repo/semantics/openAccess
openAccess
publishDate 2023
record_format openpolar
spelling ftunivhelsihelda:oai:helda.helsinki.fi:10138/357046 2025-02-16T15:09:44+00:00 Finite-state description, developing mental awareness Rueter, Jack Hurskainen, Arvi Koskenniemi, Kimmo Pirinen, Tommi Department of Digital Humanities 2023-04-17T21:09:51Z 11 application/pdf http://hdl.handle.net/10138/357046 eng eng Rule-Based Language Technology NEALT Monograph Series Suomalais-Ugrilainen Seura http://hdl.handle.net/10138/357046 unspecified info:eu-repo/semantics/openAccess openAccess Languages finite-state morphology regular morphology Võro language Lushootseed language Moksha language Erzya language Komi-Zyrian language Skolt Saami language Chapter publishedVersion 2023 ftunivhelsihelda 2025-01-21T16:11:31Z In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development in the describer-developer. This presupposes regular interaction with developers of upstream and downstream technologies. And as more languages are described, the developer learns what to choose as a starting point, hopefully with the help of a researcher, research documentation or native speaker well versed in the workings of the language. We maintain that finite-state work should serve more than one purpose or audience, and that, as linguists, we should be raising the bar by applying the knowledge of research to description, so that our understanding of the linguistic phenomena can be attested by others or proven false. We are providing a methodology for repeatable experimentation and rule making. We see that each language provides something unique, while sharing some recognizable features with other languages. We stress the necessity to avoid generating characters from epsilons and offer examples where it is possible to write rules that reduce characters to epsilons instead. We also stress the need to describe the predictable infinite set of all native phenomena, whereas the unknown and random qualities introduced through language contact cannot form a foundation for our descriptions. Finally, we call for a playful approach to phenomena in a language, because that might bring us closer to how a child would learn the language – through repetition, mistakes and self-correction. Peer reviewed Article in Journal/Newspaper saami HELDA – University of Helsinki Open Repository
spellingShingle Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
Rueter, Jack
Finite-state description, developing mental awareness
title Finite-state description, developing mental awareness
title_full Finite-state description, developing mental awareness
title_fullStr Finite-state description, developing mental awareness
title_full_unstemmed Finite-state description, developing mental awareness
title_short Finite-state description, developing mental awareness
title_sort finite-state description, developing mental awareness
topic Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
topic_facet Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
url http://hdl.handle.net/10138/357046