Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects

In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools,...

Full description

Bibliographic Details
Published in:Acta Linguistica Academica
Main Authors: Wisiorek, Axel, Schön, Zsófia
Format: Article in Journal/Newspaper
Language:Hungarian
Published: Akadémiai Kiadó 2017
Subjects:
Online Access:http://real.mtak.hu/65044/
http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf
https://doi.org/10.1556/2062.2017.64.3.4
id ftmtak:oai:real.mtak.hu:65044
record_format openpolar
spelling ftmtak:oai:real.mtak.hu:65044 2023-05-15T17:02:37+02:00 Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects Wisiorek, Axel Schön, Zsófia 2017 text http://real.mtak.hu/65044/ http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf https://doi.org/10.1556/2062.2017.64.3.4 hu hun Akadémiai Kiadó http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf Wisiorek, Axel and Schön, Zsófia (2017) Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects. Acta Linguistica Academica, 64 (3). pp. 383-396. ISSN 2559-8201 P0 Philology. Linguistics / filológia nyelvészet Article PeerReviewed info:eu-repo/semantics/article 2017 ftmtak https://doi.org/10.1556/2062.2017.64.3.4 2019-10-02T23:10:49Z In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools, OUDB provides interlinked corpus and lexicon data from digitized texts as well as recent fieldwork studies in an uniform IPA-transcription together with the corresponding audio recordings thus making these less described languages of the Ob-Ugric branch of the Finno-Ugric language family accessible for researchers as well as the language community and archiving the raw data for documentation, linguistic evaluation and possible future use in building resources for language technology applications. Article in Journal/Newspaper khanty mansi Mansi MTAK: REAL (Library and Information Centre of the Hungarian Academy of Sciences Acta Linguistica Academica 64 3 383 396
institution Open Polar
collection MTAK: REAL (Library and Information Centre of the Hungarian Academy of Sciences
op_collection_id ftmtak
language Hungarian
topic P0 Philology. Linguistics / filológia
nyelvészet
spellingShingle P0 Philology. Linguistics / filológia
nyelvészet
Wisiorek, Axel
Schön, Zsófia
Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
topic_facet P0 Philology. Linguistics / filológia
nyelvészet
description In this paper we describe the data processing procedures and the preliminary results of the project Ob-Ugric database (OUDB), a web-based framework which aims at developing corpus-based descriptive resources of Khanty and Mansi dialects. Using established language documentation and annotation tools, OUDB provides interlinked corpus and lexicon data from digitized texts as well as recent fieldwork studies in an uniform IPA-transcription together with the corresponding audio recordings thus making these less described languages of the Ob-Ugric branch of the Finno-Ugric language family accessible for researchers as well as the language community and archiving the raw data for documentation, linguistic evaluation and possible future use in building resources for language technology applications.
format Article in Journal/Newspaper
author Wisiorek, Axel
Schön, Zsófia
author_facet Wisiorek, Axel
Schön, Zsófia
author_sort Wisiorek, Axel
title Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
title_short Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
title_full Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
title_fullStr Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
title_full_unstemmed Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects
title_sort ob-ugric database : corpus and lexicon databases of khanty and mansi dialects
publisher Akadémiai Kiadó
publishDate 2017
url http://real.mtak.hu/65044/
http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf
https://doi.org/10.1556/2062.2017.64.3.4
genre khanty
mansi
Mansi
genre_facet khanty
mansi
Mansi
op_relation http://real.mtak.hu/65044/1/2062.2017.64.3.4.pdf
Wisiorek, Axel and Schön, Zsófia (2017) Ob-Ugric database : Corpus and lexicon databases of Khanty and Mansi dialects. Acta Linguistica Academica, 64 (3). pp. 383-396. ISSN 2559-8201
op_doi https://doi.org/10.1556/2062.2017.64.3.4
container_title Acta Linguistica Academica
container_volume 64
container_issue 3
container_start_page 383
op_container_end_page 396
_version_ 1766056252340699136