LIA-korpusa – eldre talemålsopptak for norsk og samisk gjort tilgjengelege
This paper presents the results from the project Language Infrastructure made Accessible (LIA) which had as its main goal to digitize and make accessible old recordings of spoken Norwegian and Sámi from various archives, first and foremost from the four partner institutions University of Oslo, Unive...
Published in: | Nordlyd |
---|---|
Main Authors: | , |
Format: | Article in Journal/Newspaper |
Language: | Norwegian Nynorsk |
Published: |
Septentrio Academic Publishing
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10037/32487 https://doi.org/10.7557/12.7157 |
Summary: | This paper presents the results from the project Language Infrastructure made Accessible (LIA) which had as its main goal to digitize and make accessible old recordings of spoken Norwegian and Sámi from various archives, first and foremost from the four partner institutions University of Oslo, University of Bergen, The Norwegian University of Science and Technology, and UiT The Arctic University of Norway. The infrastructures resulting from the project can be summarized as 1) various language technology resources such as a morphological tagger and a parser for Norwegian dialects, upgrading of the corpus interface Glossa and a new infrastructure for file depots, 2) a file depot of Norwegian dialect recordings, 3) three corpora of spoken Norwegian and one for North Sámi as well as the LIA treebank. The paper exemplifies how the corpora can be utilized. |
---|