Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank

Source at http://e-scripta.ilit.bas.bg/archives/year-2015/issue-14-15 . Journal home page at http://e-scripta.ilit.bas.bg/ . The Tromsø Old Russian and OCS Treebank (TOROT, nestor.uit.no)1 is, along with its parent treebank, the PROIEL corpus (foni.uio.no), the only existing treebank of Old Church S...

Full description

Bibliographic Details
Main Authors: Eckhoff, Hanne Martine, Berdicevskis, Aleksandrs
Format: Article in Journal/Newspaper
Language:English
Published: Institute for Literature, Bulgarian Academy of Sciences 2015
Subjects:
Online Access:https://hdl.handle.net/10037/22366
id ftunivtroemsoe:oai:munin.uit.no:10037/22366
record_format openpolar
spelling ftunivtroemsoe:oai:munin.uit.no:10037/22366 2023-05-15T18:34:23+02:00 Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank Eckhoff, Hanne Martine Berdicevskis, Aleksandrs 2015 https://hdl.handle.net/10037/22366 eng eng Institute for Literature, Bulgarian Academy of Sciences Scripta & e-Scripta info:eu-repo/grantAgreement/RCN/FRIHUMSAM/222506/Norway/Birds & Beasts: Shaping Events in Old Russian// Eckhoff HM, Berdicevskis A. Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank. Scripta & e-Scripta. 2015;14-15:9-25 FRIDAID 1266416 1312-238X https://hdl.handle.net/10037/22366 openAccess Copyright 2015 The Author(s) VDP::Humanities: 000::Linguistics: 010::Russian language: 028 VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Russisk språk: 028 VDP::Humanities: 000::Linguistics: 010::Other Slavic languages: 029 VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Andre slaviske språk: 029 Journal article Tidsskriftartikkel Peer reviewed publishedVersion 2015 ftunivtroemsoe 2021-09-08T22:53:43Z Source at http://e-scripta.ilit.bas.bg/archives/year-2015/issue-14-15 . Journal home page at http://e-scripta.ilit.bas.bg/ . The Tromsø Old Russian and OCS Treebank (TOROT, nestor.uit.no)1 is, along with its parent treebank, the PROIEL corpus (foni.uio.no), the only existing treebank of Old Church Slavonic (OCS), Old East Slavic and Middle Russian texts. There are other tagged resources, such as the Old Russian subcorpus of the Russian National Corpus2 and the Manuskript corpus,3 but none of them, to our knowledge, currently provide syntactic annotation. The TOROT presently contains approximately 160,000 word tokens of fully annotated OCS (Codex Marianus4 and Codex Suprasliensis), 85,000 word tokens of fully annotated Kiev-era Old East Slavic, and 60,000 word tokens of fully annotated 15th–17th-century Middle Russian. In addition, it contains the Codex Zographensis with automatic and partially hand-corrected morphological annotation and lemmatisation (sections of the Gospels missing in the Codex Marianus also have full syntactic annotation), and the PROIEL version of the Greek Gospels, with which the Codex Marianus and the Codex Zographensis are both aligned at token level (automatically, then hand-corrected). Article in Journal/Newspaper Tromsø University of Tromsø: Munin Open Research Archive Tromsø
institution Open Polar
collection University of Tromsø: Munin Open Research Archive
op_collection_id ftunivtroemsoe
language English
topic VDP::Humanities: 000::Linguistics: 010::Russian language: 028
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Russisk språk: 028
VDP::Humanities: 000::Linguistics: 010::Other Slavic languages: 029
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Andre slaviske språk: 029
spellingShingle VDP::Humanities: 000::Linguistics: 010::Russian language: 028
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Russisk språk: 028
VDP::Humanities: 000::Linguistics: 010::Other Slavic languages: 029
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Andre slaviske språk: 029
Eckhoff, Hanne Martine
Berdicevskis, Aleksandrs
Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
topic_facet VDP::Humanities: 000::Linguistics: 010::Russian language: 028
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Russisk språk: 028
VDP::Humanities: 000::Linguistics: 010::Other Slavic languages: 029
VDP::Humaniora: 000::Språkvitenskapelige fag: 010::Andre slaviske språk: 029
description Source at http://e-scripta.ilit.bas.bg/archives/year-2015/issue-14-15 . Journal home page at http://e-scripta.ilit.bas.bg/ . The Tromsø Old Russian and OCS Treebank (TOROT, nestor.uit.no)1 is, along with its parent treebank, the PROIEL corpus (foni.uio.no), the only existing treebank of Old Church Slavonic (OCS), Old East Slavic and Middle Russian texts. There are other tagged resources, such as the Old Russian subcorpus of the Russian National Corpus2 and the Manuskript corpus,3 but none of them, to our knowledge, currently provide syntactic annotation. The TOROT presently contains approximately 160,000 word tokens of fully annotated OCS (Codex Marianus4 and Codex Suprasliensis), 85,000 word tokens of fully annotated Kiev-era Old East Slavic, and 60,000 word tokens of fully annotated 15th–17th-century Middle Russian. In addition, it contains the Codex Zographensis with automatic and partially hand-corrected morphological annotation and lemmatisation (sections of the Gospels missing in the Codex Marianus also have full syntactic annotation), and the PROIEL version of the Greek Gospels, with which the Codex Marianus and the Codex Zographensis are both aligned at token level (automatically, then hand-corrected).
format Article in Journal/Newspaper
author Eckhoff, Hanne Martine
Berdicevskis, Aleksandrs
author_facet Eckhoff, Hanne Martine
Berdicevskis, Aleksandrs
author_sort Eckhoff, Hanne Martine
title Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
title_short Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
title_full Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
title_fullStr Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
title_full_unstemmed Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank
title_sort linguistics vs. digital editions: the tromsø old russian and ocs treebank
publisher Institute for Literature, Bulgarian Academy of Sciences
publishDate 2015
url https://hdl.handle.net/10037/22366
geographic Tromsø
geographic_facet Tromsø
genre Tromsø
genre_facet Tromsø
op_relation Scripta & e-Scripta
info:eu-repo/grantAgreement/RCN/FRIHUMSAM/222506/Norway/Birds & Beasts: Shaping Events in Old Russian//
Eckhoff HM, Berdicevskis A. Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank. Scripta & e-Scripta. 2015;14-15:9-25
FRIDAID 1266416
1312-238X
https://hdl.handle.net/10037/22366
op_rights openAccess
Copyright 2015 The Author(s)
_version_ 1766219107737272320