The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort

Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restrict...

Full description

Bibliographic Details
Published in:PLOS ONE
Main Authors: Koplenig, Alexander, Meyer, Peter, Wolfer, Sascha, Müller-Spitzer, Carolin
Format: Article in Journal/Newspaper
Language:English
Published: 2017
Subjects:
Online Access:https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5971
https://nbn-resolving.org/urn:nbn:de:bsz:mh39-59714
https://doi.org/10.1371/journal.pone.0173614
https://ids-pub.bsz-bw.de/files/5971/Koplenig_Meyer_Wolfer_Mueller_Spitzer_Statistical_trade_2017.pdf
id ftinstdeusprache:oai:ids-pub.bsz-bw.de:5971
record_format openpolar
spelling ftinstdeusprache:oai:ids-pub.bsz-bw.de:5971 2023-06-11T04:13:19+02:00 The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort Koplenig, Alexander Meyer, Peter Wolfer, Sascha Müller-Spitzer, Carolin 2017-03-13 application/pdf https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5971 https://nbn-resolving.org/urn:nbn:de:bsz:mh39-59714 https://doi.org/10.1371/journal.pone.0173614 https://ids-pub.bsz-bw.de/files/5971/Koplenig_Meyer_Wolfer_Mueller_Spitzer_Statistical_trade_2017.pdf eng eng https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5971 urn:nbn:de:bsz:mh39-59714 https://nbn-resolving.org/urn:nbn:de:bsz:mh39-59714 https://doi.org/10.1371/journal.pone.0173614 https://ids-pub.bsz-bw.de/files/5971/Koplenig_Meyer_Wolfer_Mueller_Spitzer_Statistical_trade_2017.pdf https://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess Sprachstatistik Wortstellung Informationsstruktur ddc:400 article doc-type:article 2017 ftinstdeusprache https://doi.org/10.1371/journal.pone.0173614 2023-05-07T23:06:31Z Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as these languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in almost 1,200 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of information conveyed by internal word structure: languages that rely more strongly on word order information tend to rely less on word structure information and vice versa. Or put differently, if less information is carried within the word, more information has to be spread among words in order to communicate successfully. In addition, we find that–despite differences in the way information is expressed–there is also evidence for a trade-off between different books of the biblical canon that recurs with little variation across languages: the more informative the word order of the book, the less informative its word structure and vice versa. We argue that this might suggest that, on the one hand, languages encode information in very different (but efficient) ways. On the other hand, content-related and stylistic features are statistically encoded in very similar ways. Article in Journal/Newspaper Inupiatun Institut für Deutsche Sprache: Publikationsserver PLOS ONE 12 3 e0173614
institution Open Polar
collection Institut für Deutsche Sprache: Publikationsserver
op_collection_id ftinstdeusprache
language English
topic Sprachstatistik
Wortstellung
Informationsstruktur
ddc:400
spellingShingle Sprachstatistik
Wortstellung
Informationsstruktur
ddc:400
Koplenig, Alexander
Meyer, Peter
Wolfer, Sascha
Müller-Spitzer, Carolin
The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
topic_facet Sprachstatistik
Wortstellung
Informationsstruktur
ddc:400
description Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as these languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in almost 1,200 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of information conveyed by internal word structure: languages that rely more strongly on word order information tend to rely less on word structure information and vice versa. Or put differently, if less information is carried within the word, more information has to be spread among words in order to communicate successfully. In addition, we find that–despite differences in the way information is expressed–there is also evidence for a trade-off between different books of the biblical canon that recurs with little variation across languages: the more informative the word order of the book, the less informative its word structure and vice versa. We argue that this might suggest that, on the one hand, languages encode information in very different (but efficient) ways. On the other hand, content-related and stylistic features are statistically encoded in very similar ways.
format Article in Journal/Newspaper
author Koplenig, Alexander
Meyer, Peter
Wolfer, Sascha
Müller-Spitzer, Carolin
author_facet Koplenig, Alexander
Meyer, Peter
Wolfer, Sascha
Müller-Spitzer, Carolin
author_sort Koplenig, Alexander
title The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
title_short The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
title_full The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
title_fullStr The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
title_full_unstemmed The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
title_sort statistical trade-off between word order and word structure – large-scale evidence for the principle of least effort
publishDate 2017
url https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5971
https://nbn-resolving.org/urn:nbn:de:bsz:mh39-59714
https://doi.org/10.1371/journal.pone.0173614
https://ids-pub.bsz-bw.de/files/5971/Koplenig_Meyer_Wolfer_Mueller_Spitzer_Statistical_trade_2017.pdf
genre Inupiatun
genre_facet Inupiatun
op_relation https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5971
urn:nbn:de:bsz:mh39-59714
https://nbn-resolving.org/urn:nbn:de:bsz:mh39-59714
https://doi.org/10.1371/journal.pone.0173614
https://ids-pub.bsz-bw.de/files/5971/Koplenig_Meyer_Wolfer_Mueller_Spitzer_Statistical_trade_2017.pdf
op_rights https://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
op_doi https://doi.org/10.1371/journal.pone.0173614
container_title PLOS ONE
container_volume 12
container_issue 3
container_start_page e0173614
_version_ 1768390184281309184