INEL Nenets Corpus

Corpus Citation Budzisch, Josefina; Wagner-Nagy, Beáta. 2024. INEL Nenets Corpus. Version 1.0. Publication date 2024-12-31. https://hdl.handle.net/11022/0000-0007-FE37-E. Archived at Universität Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/000...

Full description

Bibliographic Details
Main Authors:	Budzisch, Josefina, Wagner-Nagy, Beáta
Other Authors:	Arkhipov, Alexandre, Lazarenko, Elena, Riaposov, Aleksandr, Lehmberg, Timm
Format:	Dataset
Language:	unknown
Published:	2024
Subjects:	Uralic Samoyedic Nenets Forest Nenets Tundra Nenets endangered language language contact language documentation legacy data INEL AdWHH text corpus speech corpus parallel texts folklore tales narrative elicitation song transcription time-aligned audio morphological glossing part-of-speech borrowings code-switching existantial predication locative predication possessive predication English translation German translation Russian translation EXMARaLDA ELAN XML ISO/TEI nenets samoyed* Tundra
Online Access:	https://www.fdr.uni-hamburg.de/record/16518 https://doi.org/10.25592/uhhfdm.16518

_version_	1835017467841216512
author	Budzisch, Josefina Wagner-Nagy, Beáta
author2	Wagner-Nagy, Beáta Arkhipov, Alexandre Lazarenko, Elena Riaposov, Aleksandr Lehmberg, Timm
author_facet	Budzisch, Josefina Wagner-Nagy, Beáta
author_sort	Budzisch, Josefina
collection	Unknown
description	Corpus Citation Budzisch, Josefina; Wagner-Nagy, Beáta. 2024. INEL Nenets Corpus. Version 1.0. Publication date 2024-12-31. https://hdl.handle.net/11022/0000-0007-FE37-E. Archived at Universität Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1 Corpus Description The INEL Nenets corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages"), 2016–2033. The corpus includes texts recorded between 1940–2011 in both Nenets lects – Forest Nenets and Tundra Nenets. The majority of texts in this corpus originate from published works, which are appropriately cited in the relevant sections of the metadata. In particular, the following publications were used, the full information can be found in the reference section of the documentation: Barmich 2018 Burkova 2008 Burkova 2012 Burkova et al. 2003 Hajdú 1968 Koshkareva et al. 2007 Labanauskas 2001 Logany & Logany 2016 Lyubinskaya 2022 Pusztay 1976 Tereshchenko 1956 Tereshchenko 1990 Turutina 2003 Yangasova 2018 Svetlana Burkova kindly shared a collection of her Forest Nenets data including an original sound recording (Agan dialect), transcripts and glosses as Toolbox files and Word documents (Agan and Pur dialects), as well as published texts in Pur (Turutina 2003) and Numto (Logany & Logany 2016) dialects. All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English, German and Russian. Audio recording is also provided for one text. Corpus size Forest Nenets: 80 texts, 3,709 sentences, 23,597 tokens Tundra Nenets: 56 texts, 6,545 sentences, 37,681 tokens Total: 136 texts, 10,254 sentences, 61,278 tokens Total duration of audio: 44 minutes 45 seconds Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry ...
format	Dataset
genre	nenets samoyed* Tundra
genre_facet	nenets samoyed* Tundra
id	ftunihamburgdata:oai:fdr.uni-hamburg.de:16518
institution	Open Polar
language	unknown
op_collection_id	ftunihamburgdata
op_doi	https://doi.org/10.25592/uhhfdm.1651810.25592/uhhfdm.16517
op_relation	handle:11022/0000-0007-FE37-E doi:10.25592/uhhfdm.16517 doi:10.25592/uhhfdm.16518
op_rights	info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode
publishDate	2024
record_format	openpolar
spelling	ftunihamburgdata:oai:fdr.uni-hamburg.de:16518 2025-06-15T14:38:36+00:00 INEL Nenets Corpus Budzisch, Josefina Wagner-Nagy, Beáta Wagner-Nagy, Beáta Arkhipov, Alexandre Lazarenko, Elena Riaposov, Aleksandr Lehmberg, Timm 2024-12-31 https://www.fdr.uni-hamburg.de/record/16518 https://doi.org/10.25592/uhhfdm.16518 yrk unknown handle:11022/0000-0007-FE37-E doi:10.25592/uhhfdm.16517 doi:10.25592/uhhfdm.16518 info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode Uralic Samoyedic Nenets Forest Nenets Tundra Nenets endangered language language contact language documentation legacy data INEL AdWHH text corpus speech corpus parallel texts folklore tales narrative elicitation song transcription time-aligned audio morphological glossing part-of-speech borrowings code-switching existantial predication locative predication possessive predication English translation German translation Russian translation EXMARaLDA ELAN XML ISO/TEI info:eu-repo/semantics/other dataset 2024 ftunihamburgdata https://doi.org/10.25592/uhhfdm.1651810.25592/uhhfdm.16517 2025-05-19T03:13:49Z Corpus Citation Budzisch, Josefina; Wagner-Nagy, Beáta. 2024. INEL Nenets Corpus. Version 1.0. Publication date 2024-12-31. https://hdl.handle.net/11022/0000-0007-FE37-E. Archived at Universität Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1 Corpus Description The INEL Nenets corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages"), 2016–2033. The corpus includes texts recorded between 1940–2011 in both Nenets lects – Forest Nenets and Tundra Nenets. The majority of texts in this corpus originate from published works, which are appropriately cited in the relevant sections of the metadata. In particular, the following publications were used, the full information can be found in the reference section of the documentation: Barmich 2018 Burkova 2008 Burkova 2012 Burkova et al. 2003 Hajdú 1968 Koshkareva et al. 2007 Labanauskas 2001 Logany & Logany 2016 Lyubinskaya 2022 Pusztay 1976 Tereshchenko 1956 Tereshchenko 1990 Turutina 2003 Yangasova 2018 Svetlana Burkova kindly shared a collection of her Forest Nenets data including an original sound recording (Agan dialect), transcripts and glosses as Toolbox files and Word documents (Agan and Pur dialects), as well as published texts in Pur (Turutina 2003) and Numto (Logany & Logany 2016) dialects. All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English, German and Russian. Audio recording is also provided for one text. Corpus size Forest Nenets: 80 texts, 3,709 sentences, 23,597 tokens Tundra Nenets: 56 texts, 6,545 sentences, 37,681 tokens Total: 136 texts, 10,254 sentences, 61,278 tokens Total duration of audio: 44 minutes 45 seconds Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry ... Dataset nenets samoyed* Tundra Unknown
spellingShingle	Uralic Samoyedic Nenets Forest Nenets Tundra Nenets endangered language language contact language documentation legacy data INEL AdWHH text corpus speech corpus parallel texts folklore tales narrative elicitation song transcription time-aligned audio morphological glossing part-of-speech borrowings code-switching existantial predication locative predication possessive predication English translation German translation Russian translation EXMARaLDA ELAN XML ISO/TEI Budzisch, Josefina Wagner-Nagy, Beáta INEL Nenets Corpus
title	INEL Nenets Corpus
title_full	INEL Nenets Corpus
title_fullStr	INEL Nenets Corpus
title_full_unstemmed	INEL Nenets Corpus
title_short	INEL Nenets Corpus
title_sort	inel nenets corpus
topic	Uralic Samoyedic Nenets Forest Nenets Tundra Nenets endangered language language contact language documentation legacy data INEL AdWHH text corpus speech corpus parallel texts folklore tales narrative elicitation song transcription time-aligned audio morphological glossing part-of-speech borrowings code-switching existantial predication locative predication possessive predication English translation German translation Russian translation EXMARaLDA ELAN XML ISO/TEI
topic_facet	Uralic Samoyedic Nenets Forest Nenets Tundra Nenets endangered language language contact language documentation legacy data INEL AdWHH text corpus speech corpus parallel texts folklore tales narrative elicitation song transcription time-aligned audio morphological glossing part-of-speech borrowings code-switching existantial predication locative predication possessive predication English translation German translation Russian translation EXMARaLDA ELAN XML ISO/TEI
url	https://www.fdr.uni-hamburg.de/record/16518 https://doi.org/10.25592/uhhfdm.16518

INEL Nenets Corpus

Similar Items