INEL Dolgan Corpus

Corpus Citation Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2019. "INEL Dolgan Corpus." Version 1.0. Publication date 2019-08-31. https://hdl.handle.net/11022/0000-0007-CAE7-1. Archived in Hamburger Zentrum für Sprachkorpora. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferge...

Full description

Bibliographic Details
Main Authors: Däbritz, Chris Lasse, Kudryakova, Nina, Stapert, Eugénie
Other Authors: Wagner-Nagy, Be´ata, Arhipov, Aleksandre
Format: Dataset
Language:unknown
Published: 2019
Subjects:
Online Access:https://doi.org/10.25592/uhhfdm.9747
id ftunihamburgdata:oai:fdr.uni-hamburg.de:9747
record_format openpolar
spelling ftunihamburgdata:oai:fdr.uni-hamburg.de:9747 2023-05-15T16:02:15+02:00 INEL Dolgan Corpus Däbritz, Chris Lasse Kudryakova, Nina Stapert, Eugénie Wagner-Nagy, Be´ata Arhipov, Aleksandre Däbritz, Chris Lasse Kudryakova, Nina Stapert, Eugénie 2019-08-31 https://doi.org/10.25592/uhhfdm.9747 unknown handle:11022/0000-0007-CAE7-1 doi:10.25592/uhhfdm.9746 doi:10.25592/uhhfdm.9747 oai:fdr.uni-hamburg.de:9747 info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode CC-BY-NC-SA endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation info:eu-repo/semantics/other dataset 2019 ftunihamburgdata https://doi.org/10.25592/uhhfdm.9747 https://doi.org/10.25592/uhhfdm.9746 2022-12-04T23:10:57Z Corpus Citation Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2019. "INEL Dolgan Corpus." Version 1.0. Publication date 2019-08-31. https://hdl.handle.net/11022/0000-0007-CAE7-1. Archived in Hamburger Zentrum für Sprachkorpora. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages. Corpus Description The INEL Dolgan corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”), 2016–2033. The corpus makes possible typologically aware corpus-based grammatical research on the Dolgan language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Dolgan corpus is composed of texts from different sources: 1. Published folklore texts from an edited volume ("Fol'klor Dolgan", P.E. Efremov 2000), 2. Transcripts of recordings obtained from the Taymyr House of Folk Art (TDNT) in Dudinka (1970s-2000s), 3. Transcripts from the collection of Dr. Eugénie Stapert recorded on several fieldwork trips in 2007-2010, 4. Transcripts of recordings made on a fieldwork trip in 2017. The first group as well as parts of the third group were already transcribed and translated, the rest of the recordings was transcribed and translated within the INEL project. Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles and information structure/information status. Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of ... Dataset Dolgan language Taymyr ZFDM Repository (Center for Sustainable Research Data Management, Hamburg University) Dudinka ENVELOPE(86.183,86.183,69.394,69.394) Taymyr ENVELOPE(89.987,89.987,68.219,68.219)
institution Open Polar
collection ZFDM Repository (Center for Sustainable Research Data Management, Hamburg University)
op_collection_id ftunihamburgdata
language unknown
topic endangered language
indigenous language
L1 data
language contact
language documentation
INEL
folklore
narrative
monologue
annotated
morphological glossing
borrowings
code-switching
semantic roles
syntactic functions
information status
English translation
German translation
Russian translation
spellingShingle endangered language
indigenous language
L1 data
language contact
language documentation
INEL
folklore
narrative
monologue
annotated
morphological glossing
borrowings
code-switching
semantic roles
syntactic functions
information status
English translation
German translation
Russian translation
Däbritz, Chris Lasse
Kudryakova, Nina
Stapert, Eugénie
INEL Dolgan Corpus
topic_facet endangered language
indigenous language
L1 data
language contact
language documentation
INEL
folklore
narrative
monologue
annotated
morphological glossing
borrowings
code-switching
semantic roles
syntactic functions
information status
English translation
German translation
Russian translation
description Corpus Citation Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie. 2019. "INEL Dolgan Corpus." Version 1.0. Publication date 2019-08-31. https://hdl.handle.net/11022/0000-0007-CAE7-1. Archived in Hamburger Zentrum für Sprachkorpora. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages. Corpus Description The INEL Dolgan corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”), 2016–2033. The corpus makes possible typologically aware corpus-based grammatical research on the Dolgan language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Dolgan corpus is composed of texts from different sources: 1. Published folklore texts from an edited volume ("Fol'klor Dolgan", P.E. Efremov 2000), 2. Transcripts of recordings obtained from the Taymyr House of Folk Art (TDNT) in Dudinka (1970s-2000s), 3. Transcripts from the collection of Dr. Eugénie Stapert recorded on several fieldwork trips in 2007-2010, 4. Transcripts of recordings made on a fieldwork trip in 2017. The first group as well as parts of the third group were already transcribed and translated, the rest of the recordings was transcribed and translated within the INEL project. Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles and information structure/information status. Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of ...
author2 Wagner-Nagy, Be´ata
Arhipov, Aleksandre
Däbritz, Chris Lasse
Kudryakova, Nina
Stapert, Eugénie
format Dataset
author Däbritz, Chris Lasse
Kudryakova, Nina
Stapert, Eugénie
author_facet Däbritz, Chris Lasse
Kudryakova, Nina
Stapert, Eugénie
author_sort Däbritz, Chris Lasse
title INEL Dolgan Corpus
title_short INEL Dolgan Corpus
title_full INEL Dolgan Corpus
title_fullStr INEL Dolgan Corpus
title_full_unstemmed INEL Dolgan Corpus
title_sort inel dolgan corpus
publishDate 2019
url https://doi.org/10.25592/uhhfdm.9747
long_lat ENVELOPE(86.183,86.183,69.394,69.394)
ENVELOPE(89.987,89.987,68.219,68.219)
geographic Dudinka
Taymyr
geographic_facet Dudinka
Taymyr
genre Dolgan language
Taymyr
genre_facet Dolgan language
Taymyr
op_relation handle:11022/0000-0007-CAE7-1
doi:10.25592/uhhfdm.9746
doi:10.25592/uhhfdm.9747
oai:fdr.uni-hamburg.de:9747
op_rights info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode
op_rightsnorm CC-BY-NC-SA
op_doi https://doi.org/10.25592/uhhfdm.9747
https://doi.org/10.25592/uhhfdm.9746
_version_ 1766397826755985408