INEL Selkup Corpus

Corpus Citation Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2020. INEL Selkup Corpus. Version 1.0. Publication date 2020-06-30. Archived in Hamburger Zentrum für Sprachkorpora. http://hdl.handle.net/11022/0000-0007-E1D5-A. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka,...

Full description

Bibliographic Details
Main Authors:	Brykina, Maria, Orlova, Svetlana, Wagner-Nagy, Beáta
Format:	Dataset
Language:	unknown
Published:	Universität Hamburg 2020
Subjects:	endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation Valentina Chaya Baikha Kunina Selkup
Online Access:	https://dx.doi.org/10.25592/uhhfdm.9753 https://www.fdr.uni-hamburg.de/record/9753

id	ftdatacite:10.25592/uhhfdm.9753
record_format	openpolar
institution	Open Polar
collection	DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id	ftdatacite
language	unknown
topic	endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation
spellingShingle	endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation Brykina, Maria Orlova, Svetlana Wagner-Nagy, Beáta INEL Selkup Corpus
topic_facet	endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation
description	Corpus Citation Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2020. INEL Selkup Corpus. Version 1.0. Publication date 2020-06-30. Archived in Hamburger Zentrum für Sprachkorpora. http://hdl.handle.net/11022/0000-0007-E1D5-A. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages. Corpus Description The INEL Selkup corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”), 2016–2033. The corpus enables typologically aware corpus-based grammatical research on the Selkup language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Selkup corpus is composed of texts from the archive of Angelina Ivanovna Kuzmina (1924–2002), who gathered a large amount of material on Selkup in almost all regions where the Selkup people lived between 1962–1977. The archive was transferred by A.I. Kuzmina to Eugen Helimski and acquired by the Universität Hamburg in 2001. Most texts in the corpus originate from the handwritten part of the archive, the others come from sound recordings made by A.I. Kuzmina, transcribed and translated within the INEL project. The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities. Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities. Contributions/Acknowledgements Audio recordings made by Angelina Kuzmina were transcribed and translated by native speakers of Selkup: Irina Anatolyevna Korobejnikova, written transcription and Russian translation of audio in Central and Southern dialects Natalya Platonovna Izhenbina, written transcription and Russian translation of audio in Southern dialects Svetlana Nikitichna Sankevich (Kunina), oral transcription and Russian translation of audio in Northern dialects Evgeniya Sergeevna Smorgunova (Irikova), oral and written transcription and Russian translation of audio in Northern dialects Valentina Vladimirovna Tamelkina, oral transcription and Russian translation of audio in Northern dialects For individual contributions to the collecting, transcribing and analyzing of individual texts, please refer to the user documentation and to the corpus metadata. The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy, Humboldt Research Fellow at IFUU, Hamburg University New in release 1.0 The corpus now contains 264 texts from 74 speakers, representing the dialects of Middle Taz, Upper Tolka, Baikha (Northern), Narym and Tym (Central), Upper and Middle Ob, Chaya, Upper and Middle Ket (Southern). These contain 7887 sentences and 42466 words in total. Many texts have been provided with annotations for syntactic functions and semantic roles. Corrections to audio transcriptions, glossing and other annotations.
format	Dataset
author	Brykina, Maria Orlova, Svetlana Wagner-Nagy, Beáta
author_facet	Brykina, Maria Orlova, Svetlana Wagner-Nagy, Beáta
author_sort	Brykina, Maria
title	INEL Selkup Corpus
title_short	INEL Selkup Corpus
title_full	INEL Selkup Corpus
title_fullStr	INEL Selkup Corpus
title_full_unstemmed	INEL Selkup Corpus
title_sort	inel selkup corpus
publisher	Universität Hamburg
publishDate	2020
url	https://dx.doi.org/10.25592/uhhfdm.9753 https://www.fdr.uni-hamburg.de/record/9753
long_lat	ENVELOPE(161.866,161.866,55.065,55.065) ENVELOPE(135.050,135.050,59.383,59.383) ENVELOPE(87.867,87.867,65.000,65.000) ENVELOPE(67.750,67.750,-71.500,-71.500)
geographic	Valentina Chaya Baikha Kunina
geographic_facet	Valentina Chaya Baikha Kunina
genre	Selkup
genre_facet	Selkup
op_relation	https://dx.doi.org/10.25592/uhhfdm.9721
op_rights	Open Access Creative Commons Attribution Non Commercial Share Alike 4.0 International https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode cc-by-nc-sa-4.0 info:eu-repo/semantics/openAccess
op_rightsnorm	CC-BY-NC-SA
op_doi	https://doi.org/10.25592/uhhfdm.9753 https://doi.org/10.25592/uhhfdm.9721
_version_	1766196155977302016
spelling	ftdatacite:10.25592/uhhfdm.9753 2023-05-15T18:19:11+02:00 INEL Selkup Corpus Brykina, Maria Orlova, Svetlana Wagner-Nagy, Beáta 2020 https://dx.doi.org/10.25592/uhhfdm.9753 https://www.fdr.uni-hamburg.de/record/9753 unknown Universität Hamburg https://dx.doi.org/10.25592/uhhfdm.9721 Open Access Creative Commons Attribution Non Commercial Share Alike 4.0 International https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode cc-by-nc-sa-4.0 info:eu-repo/semantics/openAccess CC-BY-NC-SA endangered language indigenous language L1 data language contact language documentation INEL folklore narrative monologue annotated morphological glossing borrowings code-switching semantic roles syntactic functions information status English translation German translation Russian translation dataset Dataset 2020 ftdatacite https://doi.org/10.25592/uhhfdm.9753 https://doi.org/10.25592/uhhfdm.9721 2022-02-08T17:10:29Z Corpus Citation Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2020. INEL Selkup Corpus. Version 1.0. Publication date 2020-06-30. Archived in Hamburger Zentrum für Sprachkorpora. http://hdl.handle.net/11022/0000-0007-E1D5-A. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages. Corpus Description The INEL Selkup corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”), 2016–2033. The corpus enables typologically aware corpus-based grammatical research on the Selkup language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Selkup corpus is composed of texts from the archive of Angelina Ivanovna Kuzmina (1924–2002), who gathered a large amount of material on Selkup in almost all regions where the Selkup people lived between 1962–1977. The archive was transferred by A.I. Kuzmina to Eugen Helimski and acquired by the Universität Hamburg in 2001. Most texts in the corpus originate from the handwritten part of the archive, the others come from sound recordings made by A.I. Kuzmina, transcribed and translated within the INEL project. The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities. Funding The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities. Contributions/Acknowledgements Audio recordings made by Angelina Kuzmina were transcribed and translated by native speakers of Selkup: Irina Anatolyevna Korobejnikova, written transcription and Russian translation of audio in Central and Southern dialects Natalya Platonovna Izhenbina, written transcription and Russian translation of audio in Southern dialects Svetlana Nikitichna Sankevich (Kunina), oral transcription and Russian translation of audio in Northern dialects Evgeniya Sergeevna Smorgunova (Irikova), oral and written transcription and Russian translation of audio in Northern dialects Valentina Vladimirovna Tamelkina, oral transcription and Russian translation of audio in Northern dialects For individual contributions to the collecting, transcribing and analyzing of individual texts, please refer to the user documentation and to the corpus metadata. The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy, Humboldt Research Fellow at IFUU, Hamburg University New in release 1.0 The corpus now contains 264 texts from 74 speakers, representing the dialects of Middle Taz, Upper Tolka, Baikha (Northern), Narym and Tym (Central), Upper and Middle Ob, Chaya, Upper and Middle Ket (Southern). These contain 7887 sentences and 42466 words in total. Many texts have been provided with annotations for syntactic functions and semantic roles. Corrections to audio transcriptions, glossing and other annotations. Dataset Selkup DataCite Metadata Store (German National Library of Science and Technology) Valentina ENVELOPE(161.866,161.866,55.065,55.065) Chaya ENVELOPE(135.050,135.050,59.383,59.383) Baikha ENVELOPE(87.867,87.867,65.000,65.000) Kunina ENVELOPE(67.750,67.750,-71.500,-71.500)

INEL Selkup Corpus

Similar Items