MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth

MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society...

Full description

Bibliographic Details
Main Author: Castrén, M. A.
Other Authors: Lukin, Karina, Partanen, Niko, Günter Mühlberger, Günter Hackl
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2021
Subjects:
HTR
Online Access:https://doi.org/10.5281/zenodo.5759599
id ftzenodo:oai:zenodo.org:5759599
record_format openpolar
spelling ftzenodo:oai:zenodo.org:5759599 2024-09-15T18:19:08+00:00 MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth Castrén, M. A. Lukin, Karina Partanen, Niko Günter Mühlberger Günter Hackl 2021-12-05 https://doi.org/10.5281/zenodo.5759599 yrk unknown Zenodo https://doi.org/10.5281/zenodo.5759586 https://doi.org/10.5281/zenodo.5759599 oai:zenodo.org:5759599 info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode Ground Truth HTR Tundra Nenets M. A. Castrén Manuscript Linguistics Ethnography info:eu-repo/semantics/other 2021 ftzenodo https://doi.org/10.5281/zenodo.575959910.5281/zenodo.5759586 2024-07-26T06:31:41Z MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society and the University of Innsbruck. It is intended specifically for handwritten text recognition experiments, training and benchmarking. For electronic materials and printed volumes that are intended to be used in linguistic, ethnographic and folkloric research, please refer to other publications in this Zenodo collection or [Manuscripta Castreaniana website](https://www.sgr.fi/manuscripta/). The materials were aligned in the University of Innsbruck with contributions by Günter Mühlberger and Günter Hackl. Other contributors are Karina Lukin and Niko Partanen. [Transkribus](https://readcoop.eu/transkribus/?sc=Transkribus) platform was extensively used in processing this dataset, and the file format is a direct Transkribus image and Page XML export. Other/Unknown Material nenets samoied* Tundra Zenodo
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language unknown
topic Ground Truth
HTR
Tundra Nenets
M. A. Castrén
Manuscript
Linguistics
Ethnography
spellingShingle Ground Truth
HTR
Tundra Nenets
M. A. Castrén
Manuscript
Linguistics
Ethnography
Castrén, M. A.
MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
topic_facet Ground Truth
HTR
Tundra Nenets
M. A. Castrén
Manuscript
Linguistics
Ethnography
description MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth This dataset contains 172 microfilm scans Tundra Nenets materials, in which the text content is manually aligned line by line with the scanned images. This material has been created in collaboration between the Finno-Ugrian Society and the University of Innsbruck. It is intended specifically for handwritten text recognition experiments, training and benchmarking. For electronic materials and printed volumes that are intended to be used in linguistic, ethnographic and folkloric research, please refer to other publications in this Zenodo collection or [Manuscripta Castreaniana website](https://www.sgr.fi/manuscripta/). The materials were aligned in the University of Innsbruck with contributions by Günter Mühlberger and Günter Hackl. Other contributors are Karina Lukin and Niko Partanen. [Transkribus](https://readcoop.eu/transkribus/?sc=Transkribus) platform was extensively used in processing this dataset, and the file format is a direct Transkribus image and Page XML export.
author2 Lukin, Karina
Partanen, Niko
Günter Mühlberger
Günter Hackl
format Other/Unknown Material
author Castrén, M. A.
author_facet Castrén, M. A.
author_sort Castrén, M. A.
title MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
title_short MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
title_full MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
title_fullStr MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
title_full_unstemmed MC VIII SAMOIEDICA 2: JURAK-SAMOIEDICA 1: Line-aligned Ground Truth
title_sort mc viii samoiedica 2: jurak-samoiedica 1: line-aligned ground truth
publisher Zenodo
publishDate 2021
url https://doi.org/10.5281/zenodo.5759599
genre nenets
samoied*
Tundra
genre_facet nenets
samoied*
Tundra
op_relation https://doi.org/10.5281/zenodo.5759586
https://doi.org/10.5281/zenodo.5759599
oai:zenodo.org:5759599
op_rights info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
op_doi https://doi.org/10.5281/zenodo.575959910.5281/zenodo.5759586
_version_ 1810457219725000704