Archive Infrastructure and Spoken Language Corpora for Saami Languages in Finland

This study presents the results of an Aanaar Saami pilot project in the Saami Culture Archive, University of Oulu. The project has established a set of conventions to transcribe and annotate Aanaar Saami recordings in the archive's collection and created a mechanism through which grammatically...

Full description

Bibliographic Details
Main Authors: Jouste, Marko, Mettovaara, Jukka, Morottaja, Petter, Partanen, Niko
Other Authors: Berglund, Karl, La Mela, Matti, Zwart, Inge, The National Library of Finland, Library Network Services
Format: Conference Object
Language:English
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/10138/350322
Description
Summary:This study presents the results of an Aanaar Saami pilot project in the Saami Culture Archive, University of Oulu. The project has established a set of conventions to transcribe and annotate Aanaar Saami recordings in the archive's collection and created a mechanism through which grammatically annotated but anonymous versions can be imported to the Korp search interface in the Language Bank of Finland. The practices include wide use of Saami language technology, the use of Finnish computational research infrastructure, and they can be extended later to other Saami languages in the archive. Peer reviewed