The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use

International audience The Karjala database contains digitized demographic data of the parish registers from the regions ceded to the Soviet Union in 1944. The objectives of the digitization project have been to promote access to digitized records for scientific research and genealogy as well as enc...

Full description

Bibliographic Details
Main Authors: Saarti, Jarmo, Ropponen, Jari, Soivanen, Satu
Other Authors: University of Eastern Finland, DARIAH
Format: Conference Object
Language:English
Published: HAL CCSD 2017
Subjects:
Online Access:https://inria.hal.science/hal-01660143
https://inria.hal.science/hal-01660143/document
https://inria.hal.science/hal-01660143/file/331967.pdf
id ftccsdartic:oai:HAL:hal-01660143v1
record_format openpolar
spelling ftccsdartic:oai:HAL:hal-01660143v1 2023-06-18T03:41:31+02:00 The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use Saarti, Jarmo Ropponen, Jari Soivanen, Satu University of Eastern Finland DARIAH Berlin, Germany 2017-08-15 https://inria.hal.science/hal-01660143 https://inria.hal.science/hal-01660143/document https://inria.hal.science/hal-01660143/file/331967.pdf en eng HAL CCSD hal-01660143 https://inria.hal.science/hal-01660143 https://inria.hal.science/hal-01660143/document https://inria.hal.science/hal-01660143/file/331967.pdf http://creativecommons.org/licenses/by/ info:eu-repo/semantics/OpenAccess DH. Opportunities and Risks. Connecting Libraries and Research https://inria.hal.science/hal-01660143 DH. Opportunities and Risks. Connecting Libraries and Research, Aug 2017, Berlin, Germany https://dh-libraries.sciencesconf.org digitization genealogical documents handwriting Karelia Finland [INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL] info:eu-repo/semantics/conferenceObject Conference papers 2017 ftccsdartic 2023-06-04T05:50:13Z International audience The Karjala database contains digitized demographic data of the parish registers from the regions ceded to the Soviet Union in 1944. The objectives of the digitization project have been to promote access to digitized records for scientific research and genealogy as well as encouraging research on the people of the ceded Karelia region. The main sources for the database have been catechetical lists, lists of children, and registers of vital statistics (registers of births, marriages, migrations and deaths) that are available in Digital Archives of the National Archives of Finland from the period of 1681 – 1949. The data in the database amounts to about 10.3 million entries, but only data older than 100 years is published openly on the Internet. According to decisions by the Finnish data protection authorities, the Personal Data Act is applied to personal registers less than 100 years old. The digitization process is still going on; it has been calculated that there are 1.2 million entries still to be processed. The database is available to users via https://katiha.mamk.fi/. At present, there are about 6.5 million file entries available on the Internet, each presenting data about one individual, e.g. names, the date of birth and death, the cause of death, age, gender, marital status, occupation, residence, migration, the parish. The Karjala database can be exploited for diverse research purposes; it improves access to the church records that are sometimes very difficult to read. Information in the database can be utilized for historical research, medical genetics, social sciences, and family and onomastics. The database is can be utilized for clarifying family structures, migratory patterns or child mortality. The database also offers excellent opportunities for interdisciplinary research. Our presentation will describe the digitization process management of old, handwritten documents that consist of non-structured data from a historical period that contains varied linguistic material: ... Conference Object karelia* Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
institution Open Polar
collection Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id ftccsdartic
language English
topic digitization
genealogical documents
handwriting
Karelia
Finland
[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]
spellingShingle digitization
genealogical documents
handwriting
Karelia
Finland
[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]
Saarti, Jarmo
Ropponen, Jari
Soivanen, Satu
The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
topic_facet digitization
genealogical documents
handwriting
Karelia
Finland
[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]
description International audience The Karjala database contains digitized demographic data of the parish registers from the regions ceded to the Soviet Union in 1944. The objectives of the digitization project have been to promote access to digitized records for scientific research and genealogy as well as encouraging research on the people of the ceded Karelia region. The main sources for the database have been catechetical lists, lists of children, and registers of vital statistics (registers of births, marriages, migrations and deaths) that are available in Digital Archives of the National Archives of Finland from the period of 1681 – 1949. The data in the database amounts to about 10.3 million entries, but only data older than 100 years is published openly on the Internet. According to decisions by the Finnish data protection authorities, the Personal Data Act is applied to personal registers less than 100 years old. The digitization process is still going on; it has been calculated that there are 1.2 million entries still to be processed. The database is available to users via https://katiha.mamk.fi/. At present, there are about 6.5 million file entries available on the Internet, each presenting data about one individual, e.g. names, the date of birth and death, the cause of death, age, gender, marital status, occupation, residence, migration, the parish. The Karjala database can be exploited for diverse research purposes; it improves access to the church records that are sometimes very difficult to read. Information in the database can be utilized for historical research, medical genetics, social sciences, and family and onomastics. The database is can be utilized for clarifying family structures, migratory patterns or child mortality. The database also offers excellent opportunities for interdisciplinary research. Our presentation will describe the digitization process management of old, handwritten documents that consist of non-structured data from a historical period that contains varied linguistic material: ...
author2 University of Eastern Finland
DARIAH
format Conference Object
author Saarti, Jarmo
Ropponen, Jari
Soivanen, Satu
author_facet Saarti, Jarmo
Ropponen, Jari
Soivanen, Satu
author_sort Saarti, Jarmo
title The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
title_short The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
title_full The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
title_fullStr The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
title_full_unstemmed The Karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
title_sort karjala database – challenges and solutions for digitizing heterogeneous, old genealogical documents for internet use
publisher HAL CCSD
publishDate 2017
url https://inria.hal.science/hal-01660143
https://inria.hal.science/hal-01660143/document
https://inria.hal.science/hal-01660143/file/331967.pdf
op_coverage Berlin, Germany
genre karelia*
genre_facet karelia*
op_source DH. Opportunities and Risks. Connecting Libraries and Research
https://inria.hal.science/hal-01660143
DH. Opportunities and Risks. Connecting Libraries and Research, Aug 2017, Berlin, Germany
https://dh-libraries.sciencesconf.org
op_relation hal-01660143
https://inria.hal.science/hal-01660143
https://inria.hal.science/hal-01660143/document
https://inria.hal.science/hal-01660143/file/331967.pdf
op_rights http://creativecommons.org/licenses/by/
info:eu-repo/semantics/OpenAccess
_version_ 1769007136086425600