A Bayesian approach to the classification of Tungusic languages

The Tungusic language family is comprised of languages spoken in Siberia, the Russian Far East, Northeast China and Xinjiang. There is a general consensus that these languages are genealogically related and descend from a common ancestral language. Nevertheless, there is considerable disagreement wi...

Full description

Bibliographic Details
Published in:Diachronica
Main Authors: Oskolskaya, S., Koile, E., Robbeets, M.
Format: Article in Journal/Newspaper
Language:English
Published: 2021
Subjects:
Online Access:http://hdl.handle.net/21.11116/0000-0009-A4D4-E
id ftpubman:oai:pure.mpg.de:item_3358460
record_format openpolar
spelling ftpubman:oai:pure.mpg.de:item_3358460 2023-08-27T04:12:25+02:00 A Bayesian approach to the classification of Tungusic languages Oskolskaya, S. Koile, E. Robbeets, M. 2021-05-25 http://hdl.handle.net/21.11116/0000-0009-A4D4-E eng eng info:eu-repo/semantics/altIdentifier/doi/10.1075/dia.20010.osk http://hdl.handle.net/21.11116/0000-0009-A4D4-E Diachronica info:eu-repo/semantics/article 2021 ftpubman https://doi.org/10.1075/dia.20010.osk 2023-08-02T01:25:42Z The Tungusic language family is comprised of languages spoken in Siberia, the Russian Far East, Northeast China and Xinjiang. There is a general consensus that these languages are genealogically related and descend from a common ancestral language. Nevertheless, there is considerable disagreement with regard to the internal structure of the Tungusic family and the time depth of its separation into daughter languages. Here we use computational Bayesian phylogenetic methods to generate a phylogeny of Tungusic languages and estimate the time-depth of the family. Our analysis is based on the recently introduced Leipzig-Jakarta-Jena list, a dataset of 254 basic vocabulary items collected for 21 Tungusic doculects. Our results are consistent with two basic classifications previously proposed in the literature, notably a Manchu-Tungusic classification, in which the break-up of Jurchenic constitutes the first split in the tree, as well as a North-South classification, which includes a Jurchenic-Nanaic and an Orochic-Ewenic branch. In addition, we obtain a time-depth for the age of Proto-Tungusic between the 8th century BC and the 12th century AD (95% highest posterior density interval). Previous classifications of Tungusic were based on both classical historical comparative linguistic and lexicostatistic approaches, but the application of Bayesian phylogenetic methods to the Tungusic languages has not so far been attempted. In contrast to previous approaches, our Bayesian analysis adds an understanding of the statistical robustness of the proposed branches and infers absolute divergence dates, allowing variation of rates of change across branches and cognate sets. In this way, our research provides a reliable quantitative basis for previous estimates based on classical historical linguistic and lexicostatistic approaches. Article in Journal/Newspaper Tungusic languages Siberia Max Planck Society: MPG.PuRe Diachronica
institution Open Polar
collection Max Planck Society: MPG.PuRe
op_collection_id ftpubman
language English
description The Tungusic language family is comprised of languages spoken in Siberia, the Russian Far East, Northeast China and Xinjiang. There is a general consensus that these languages are genealogically related and descend from a common ancestral language. Nevertheless, there is considerable disagreement with regard to the internal structure of the Tungusic family and the time depth of its separation into daughter languages. Here we use computational Bayesian phylogenetic methods to generate a phylogeny of Tungusic languages and estimate the time-depth of the family. Our analysis is based on the recently introduced Leipzig-Jakarta-Jena list, a dataset of 254 basic vocabulary items collected for 21 Tungusic doculects. Our results are consistent with two basic classifications previously proposed in the literature, notably a Manchu-Tungusic classification, in which the break-up of Jurchenic constitutes the first split in the tree, as well as a North-South classification, which includes a Jurchenic-Nanaic and an Orochic-Ewenic branch. In addition, we obtain a time-depth for the age of Proto-Tungusic between the 8th century BC and the 12th century AD (95% highest posterior density interval). Previous classifications of Tungusic were based on both classical historical comparative linguistic and lexicostatistic approaches, but the application of Bayesian phylogenetic methods to the Tungusic languages has not so far been attempted. In contrast to previous approaches, our Bayesian analysis adds an understanding of the statistical robustness of the proposed branches and infers absolute divergence dates, allowing variation of rates of change across branches and cognate sets. In this way, our research provides a reliable quantitative basis for previous estimates based on classical historical linguistic and lexicostatistic approaches.
format Article in Journal/Newspaper
author Oskolskaya, S.
Koile, E.
Robbeets, M.
spellingShingle Oskolskaya, S.
Koile, E.
Robbeets, M.
A Bayesian approach to the classification of Tungusic languages
author_facet Oskolskaya, S.
Koile, E.
Robbeets, M.
author_sort Oskolskaya, S.
title A Bayesian approach to the classification of Tungusic languages
title_short A Bayesian approach to the classification of Tungusic languages
title_full A Bayesian approach to the classification of Tungusic languages
title_fullStr A Bayesian approach to the classification of Tungusic languages
title_full_unstemmed A Bayesian approach to the classification of Tungusic languages
title_sort bayesian approach to the classification of tungusic languages
publishDate 2021
url http://hdl.handle.net/21.11116/0000-0009-A4D4-E
genre Tungusic languages
Siberia
genre_facet Tungusic languages
Siberia
op_source Diachronica
op_relation info:eu-repo/semantics/altIdentifier/doi/10.1075/dia.20010.osk
http://hdl.handle.net/21.11116/0000-0009-A4D4-E
op_doi https://doi.org/10.1075/dia.20010.osk
container_title Diachronica
_version_ 1775356537184714752