Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools,...
Published in: | Acta Linguistica Academica |
---|---|
Main Authors: | , |
Format: | Article in Journal/Newspaper |
Language: | Hungarian |
Published: |
Akadémiai Kiadó
2017
|
Subjects: | |
Online Access: | http://real.mtak.hu/65044/ http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf https://doi.org/10.1556/2062.2017.64.3.4 |
id |
ftmtak:oai:real.mtak.hu:65044 |
---|---|
record_format |
openpolar |
spelling |
ftmtak:oai:real.mtak.hu:65044 2023-05-15T17:02:37+02:00 Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects Wisiorek, Axel Schön, Zsófia 2017 text http://real.mtak.hu/65044/ http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf https://doi.org/10.1556/2062.2017.64.3.4 hu hun Akadémiai Kiadó http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf Wisiorek, Axel and Schön, Zsófia (2017) Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects. Acta Linguistica Academica, 64 (3). pp. 383-396. ISSN 2559-8201 P0 Philology. Linguistics / filológia nyelvészet Article PeerReviewed info:eu-repo/semantics/article 2017 ftmtak https://doi.org/10.1556/2062.2017.64.3.4 2019-10-02T23:10:49Z In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools, OUDB provides interlinked corpus and lexicon data from digitized texts as well as recent fieldwork studies in an uniform IPA-transcription together with the corresponding audio recordings thus making these less described languages of the Ob-Ugric branch of the Finno-Ugric language family accessible for researchers as well as the language community and archiving the raw data for documentation, linguistic evaluation and possible future use in building resources for language technology applications. Article in Journal/Newspaper khanty mansi Mansi MTAK: REAL (Library and Information Centre of the Hungarian Academy of Sciences Acta Linguistica Academica 64 3 383 396 |
institution |
Open Polar |
collection |
MTAK: REAL (Library and Information Centre of the Hungarian Academy of Sciences |
op_collection_id |
ftmtak |
language |
Hungarian |
topic |
P0 Philology. Linguistics / filológia nyelvészet |
spellingShingle |
P0 Philology. Linguistics / filológia nyelvészet Wisiorek, Axel Schön, Zsófia Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
topic_facet |
P0 Philology. Linguistics / filológia nyelvészet |
description |
In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools, OUDB provides interlinked corpus and lexicon data from digitized texts as well as recent fieldwork studies in an uniform IPA-transcription together with the corresponding audio recordings thus making these less described languages of the Ob-Ugric branch of the Finno-Ugric language family accessible for researchers as well as the language community and archiving the raw data for documentation, linguistic evaluation and possible future use in building resources for language technology applications. |
format |
Article in Journal/Newspaper |
author |
Wisiorek, Axel Schön, Zsófia |
author_facet |
Wisiorek, Axel Schön, Zsófia |
author_sort |
Wisiorek, Axel |
title |
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
title_short |
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
title_full |
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
title_fullStr |
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
title_full_unstemmed |
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects |
title_sort |
ob-ugric database : corpus and lexicon databases of khanty and mansi dialects |
publisher |
Akadémiai Kiadó |
publishDate |
2017 |
url |
http://real.mtak.hu/65044/ http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf https://doi.org/10.1556/2062.2017.64.3.4 |
genre |
khanty mansi Mansi |
genre_facet |
khanty mansi Mansi |
op_relation |
http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf Wisiorek, Axel and Schön, Zsófia (2017) Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects. Acta Linguistica Academica, 64 (3). pp. 383-396. ISSN 2559-8201 |
op_doi |
https://doi.org/10.1556/2062.2017.64.3.4 |
container_title |
Acta Linguistica Academica |
container_volume |
64 |
container_issue |
3 |
container_start_page |
383 |
op_container_end_page |
396 |
_version_ |
1766056252340699136 |