INEL Evenki Corpus

Corpus Citation Däbritz, Chris Lasse & Gusev, Valentin. 2021. INEL Evenki Corpus. Version 1.0. Publication date 2021-12-31. Archived at Universität Hamburg. https://hdl.handle.net/11022/0000-0007-F43C-3. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/110...

Full description

Bibliographic Details
Main Authors: Däbritz, Chris Lasse, Gusev, Valentin
Other Authors: Wagner-Nagy, Be´ata, Arkhipov, Alexandre, Ferger, Anne, Jettka, Daniel, Lazarenko, Elena, Lehmberg, Timm, Riaposov, Aleksandr
Format: Dataset
Language:unknown
Published: 2021
Subjects:
Online Access:https://doi.org/10.25592/uhhfdm.9628
Description
Summary:Corpus Citation Däbritz, Chris Lasse & Gusev, Valentin. 2021. INEL Evenki Corpus. Version 1.0. Publication date 2021-12-31. Archived at Universität Hamburg. https://hdl.handle.net/11022/0000-0007-F43C-3. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1 Corpus Description The INEL Evenki Corpus has been created within the long-term INEL project (Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages), 2016–2033. The corpus makes possible typologically aware corpus-based grammatical research on the Evenki (< Tungusic) language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Evenki Corpus covers Northern (Taimyr, Khantayskoe Ozero, Ilimpi, Erbogachon) and Southern (Sym) Evenki dialects, which have or had contacts with other languages dealt with in the INEL project, that is, first and foremost Dolgan and Selkup. The INEL Evenki Corpus is composed of texts from different sources: Published texts from different text collections, inter alia "Sbornik materialov po evenkijskomu (tungusskomu) fol'kloru" (Vasilevich 1936), covering all named dialects. Transcripts of recordings obtained from the Taimyr House of National Arts (TDNT) in Dudinka (2000s) as well as transcripts of recordings made by and from Tat’yana V. Bolina, either of them representing the Khantayskoe Ozero dialect. Texts from the handwritten archive of the Russian ethnographer and linguist Konstantin M. Rychkov recorded in the 1900s/1910s, covering the Taimyr, Ilimpi and Sym dialects. Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles and information status. Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ ...