Finite-state description, developing mental awareness

In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development...

Full description

Bibliographic Details
Main Author: Rueter, Jack
Other Authors: Hurskainen, Arvi, Koskenniemi, Kimmo, Pirinen, Tommi, Department of Digital Humanities
Format: Article in Journal/Newspaper
Language:English
Published: 2023
Subjects:
Online Access:http://hdl.handle.net/10138/357046
id ftunivhelsihelda:oai:helda.helsinki.fi:10138/357046
record_format openpolar
spelling ftunivhelsihelda:oai:helda.helsinki.fi:10138/357046 2024-02-11T10:08:18+01:00 Finite-state description, developing mental awareness Rueter, Jack Hurskainen, Arvi Koskenniemi, Kimmo Pirinen, Tommi Department of Digital Humanities 2023-04-17T21:09:51Z 11 application/pdf http://hdl.handle.net/10138/357046 eng eng Rule-Based Language Technology NEALT Monograph Series Rueter , J 2023 , Finite-state description, developing mental awareness . in A Hurskainen , K Koskenniemi & T Pirinen (eds) , Rule-Based Language Technology . NEALT Monograph Series , vol. 2[1] , Northern European Association for Language Technology , Tartu , pp. 217-227 . ORCID: /0000-0002-3076-7929/work/133565965 629d578d-a4b7-4812-b996-02267c00982c http://hdl.handle.net/10138/357046 unspecified openAccess info:eu-repo/semantics/openAccess 6121 Languages finite-state morphology regular morphology Võro language Lushootseed language Moksha language Erzya language Komi-Zyrian language Skolt Saami language Chapter publishedVersion 2023 ftunivhelsihelda 2024-01-18T00:01:37Z In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development in the describer-developer. This presupposes regular interaction with developers of upstream and downstream technologies. And as more languages are described, the developer learns what to choose as a starting point, hopefully with the help of a researcher, research documentation or native speaker well versed in the workings of the language. We maintain that finite-state work should serve more than one purpose or audience, and that, as linguists, we should be raising the bar by applying the knowledge of research to description, so that our understanding of the linguistic phenomena can be attested by others or proven false. We are providing a methodology for repeatable experimentation and rule making. We see that each language provides something unique, while sharing some recognizable features with other languages. We stress the necessity to avoid generating characters from epsilons and offer examples where it is possible to write rules that reduce characters to epsilons instead. We also stress the need to describe the predictable infinite set of all native phenomena, whereas the unknown and random qualities introduced through language contact cannot form a foundation for our descriptions. Finally, we call for a playful approach to phenomena in a language, because that might bring us closer to how a child would learn the language – through repetition, mistakes and self-correction. Peer reviewed Article in Journal/Newspaper saami HELDA – University of Helsinki Open Repository
institution Open Polar
collection HELDA – University of Helsinki Open Repository
op_collection_id ftunivhelsihelda
language English
topic 6121 Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
spellingShingle 6121 Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
Rueter, Jack
Finite-state description, developing mental awareness
topic_facet 6121 Languages
finite-state morphology
regular morphology
Võro language
Lushootseed language
Moksha language
Erzya language
Komi-Zyrian language
Skolt Saami language
description In this article, we approach finite-state description practices that must be instilled in the developer. Thoughts are presented accompanied by reference to concrete experiences with different languages and their description. We contend that finite-state description of languages leads to development in the describer-developer. This presupposes regular interaction with developers of upstream and downstream technologies. And as more languages are described, the developer learns what to choose as a starting point, hopefully with the help of a researcher, research documentation or native speaker well versed in the workings of the language. We maintain that finite-state work should serve more than one purpose or audience, and that, as linguists, we should be raising the bar by applying the knowledge of research to description, so that our understanding of the linguistic phenomena can be attested by others or proven false. We are providing a methodology for repeatable experimentation and rule making. We see that each language provides something unique, while sharing some recognizable features with other languages. We stress the necessity to avoid generating characters from epsilons and offer examples where it is possible to write rules that reduce characters to epsilons instead. We also stress the need to describe the predictable infinite set of all native phenomena, whereas the unknown and random qualities introduced through language contact cannot form a foundation for our descriptions. Finally, we call for a playful approach to phenomena in a language, because that might bring us closer to how a child would learn the language – through repetition, mistakes and self-correction. Peer reviewed
author2 Hurskainen, Arvi
Koskenniemi, Kimmo
Pirinen, Tommi
Department of Digital Humanities
format Article in Journal/Newspaper
author Rueter, Jack
author_facet Rueter, Jack
author_sort Rueter, Jack
title Finite-state description, developing mental awareness
title_short Finite-state description, developing mental awareness
title_full Finite-state description, developing mental awareness
title_fullStr Finite-state description, developing mental awareness
title_full_unstemmed Finite-state description, developing mental awareness
title_sort finite-state description, developing mental awareness
publishDate 2023
url http://hdl.handle.net/10138/357046
genre saami
genre_facet saami
op_relation Rule-Based Language Technology
NEALT Monograph Series
Rueter , J 2023 , Finite-state description, developing mental awareness . in A Hurskainen , K Koskenniemi & T Pirinen (eds) , Rule-Based Language Technology . NEALT Monograph Series , vol. 2[1] , Northern European Association for Language Technology , Tartu , pp. 217-227 .
ORCID: /0000-0002-3076-7929/work/133565965
629d578d-a4b7-4812-b996-02267c00982c
http://hdl.handle.net/10138/357046
op_rights unspecified
openAccess
info:eu-repo/semantics/openAccess
_version_ 1790607368891924480