LIA-korpusa – eldre talemålsopptak for norsk og samisk gjort tilgjengelege

This paper presents the results from the project Language Infrastructure made Accessible (LIA) which had as its main goal to digitize and make accessible old recordings of spoken Norwegian and Sámi from various archives, first and foremost from the four partner institutions University of Oslo, Unive...

Full description

Bibliographic Details
Published in:Nordlyd
Main Authors: Kristin Hagen, Øystein A. Vangsnes
Format: Article in Journal/Newspaper
Language:English
Norwegian
Published: Septentrio Academic Publishing 2023
Subjects:
Online Access:https://doi.org/10.7557/12.7157
https://doaj.org/article/8bf38b900c4d43e0953d8fabdd6a02ae
Description
Summary:This paper presents the results from the project Language Infrastructure made Accessible (LIA) which had as its main goal to digitize and make accessible old recordings of spoken Norwegian and Sámi from various archives, first and foremost from the four partner institutions University of Oslo, University of Bergen, The Norwegian University of Science and Technology, and UiT The Arctic University of Norway. The infra­structures resulting from the project can be summarized as 1) a file depot of Norwegian dialect recordings, 2) three corpora of spoken Norwegian and one for North Sámi, 3) various language technology resources such as an openly available tree bank, a morphological taggar and a parser for Norwegian dialects, upgrading of the corpus interface Glossa and more. The paper exemplifies how the corpus can be utilized.