MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society...
Main Author: | |
---|---|
Format: | Dataset |
Language: | unknown |
Published: |
Zenodo
2021
|
Subjects: | |
Online Access: | https://dx.doi.org/10.5281/zenodo.5759586 https://zenodo.org/record/5759586 |
id |
ftdatacite:10.5281/zenodo.5759586 |
---|---|
record_format |
openpolar |
spelling |
ftdatacite:10.5281/zenodo.5759586 2023-05-15T17:14:31+02:00 MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth Castrén, M. A. 2021 https://dx.doi.org/10.5281/zenodo.5759586 https://zenodo.org/record/5759586 unknown Zenodo https://dx.doi.org/10.5281/zenodo.5759587 https://dx.doi.org/10.5281/zenodo.5759599 Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess CC-BY Ground Truth HTR Tundra Nenets M. A. Castrén Manuscript Linguistics FOS Languages and literature Ethnography dataset Dataset 2021 ftdatacite https://doi.org/10.5281/zenodo.5759586 https://doi.org/10.5281/zenodo.5759587 https://doi.org/10.5281/zenodo.5759599 2022-02-08T15:32:11Z MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society and the University of Innsbruck. It is intended specifically for handwritten text recognition experiments, training and benchmarking. For electronic materials and printed volumes that are intended to be used in linguistic, ethnographic and folkloric research, please refer to other publications in this Zenodo collection or [Manuscripta Castreaniana website](https://www.sgr.fi/manuscripta/). The materials were aligned in the University of Innsbruck with contributions by Günter Mühlberger and Günter Hackl. Other contributors are Karina Lukin and Niko Partanen. [Transkribus](https://readcoop.eu/transkribus/?sc=Transkribus) platform was extensively used in processing this dataset, and the file format is a direct Transkribus image and Page XML export. Dataset nenets samoied* Tundra DataCite Metadata Store (German National Library of Science and Technology) |
institution |
Open Polar |
collection |
DataCite Metadata Store (German National Library of Science and Technology) |
op_collection_id |
ftdatacite |
language |
unknown |
topic |
Ground Truth HTR Tundra Nenets M. A. Castrén Manuscript Linguistics FOS Languages and literature Ethnography |
spellingShingle |
Ground Truth HTR Tundra Nenets M. A. Castrén Manuscript Linguistics FOS Languages and literature Ethnography Castrén, M. A. MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
topic_facet |
Ground Truth HTR Tundra Nenets M. A. Castrén Manuscript Linguistics FOS Languages and literature Ethnography |
description |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society and the University of Innsbruck. It is intended specifically for handwritten text recognition experiments, training and benchmarking. For electronic materials and printed volumes that are intended to be used in linguistic, ethnographic and folkloric research, please refer to other publications in this Zenodo collection or [Manuscripta Castreaniana website](https://www.sgr.fi/manuscripta/). The materials were aligned in the University of Innsbruck with contributions by Günter Mühlberger and Günter Hackl. Other contributors are Karina Lukin and Niko Partanen. [Transkribus](https://readcoop.eu/transkribus/?sc=Transkribus) platform was extensively used in processing this dataset, and the file format is a direct Transkribus image and Page XML export. |
format |
Dataset |
author |
Castrén, M. A. |
author_facet |
Castrén, M. A. |
author_sort |
Castrén, M. A. |
title |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
title_short |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
title_full |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
title_fullStr |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
title_full_unstemmed |
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth |
title_sort |
mc viii samoiedica 2: jurak-samoiedica 1: line-aligned ground truth |
publisher |
Zenodo |
publishDate |
2021 |
url |
https://dx.doi.org/10.5281/zenodo.5759586 https://zenodo.org/record/5759586 |
genre |
nenets samoied* Tundra |
genre_facet |
nenets samoied* Tundra |
op_relation |
https://dx.doi.org/10.5281/zenodo.5759587 https://dx.doi.org/10.5281/zenodo.5759599 |
op_rights |
Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess |
op_rightsnorm |
CC-BY |
op_doi |
https://doi.org/10.5281/zenodo.5759586 https://doi.org/10.5281/zenodo.5759587 https://doi.org/10.5281/zenodo.5759599 |
_version_ |
1766071898821623808 |