Archive infrastructure and spoken language corpora for Saami languages in Finland

Abstract This study presents the results of an Aanaar Saami pilot project in the Saami Culture Archive, University of Oulu. The project has established a set of conventions to transcribe and annotate Aanaar Saami recordings in the archive’s collection and created a mechanism through which grammatica...

Full description

Bibliographic Details
Main Authors: Jouste, M. (Marko), Mettovaara, J. (Jukka), Morottaja, P. (Petter), Partanen, N. (Niko)
Format: Conference Object
Language:English
Published: RWTH Aachen University 2022
Subjects:
Online Access:http://urn.fi/urn:nbn:fi-fe2022102062652
Description
Summary:Abstract This study presents the results of an Aanaar Saami pilot project in the Saami Culture Archive, University of Oulu. The project has established a set of conventions to transcribe and annotate Aanaar Saami recordings in the archive’s collection and created a mechanism through which grammatically annotated but anonymous versions can be imported to the Korp search interface in the Language Bank of Finland. The practices include wide use of Saami language technology, the use of Finnish computational research infrastructure, and they can be extended later to other Saami languages in the archive.