Samrómur Children Icelandic Speech 1.0

Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years)...

Full description

Bibliographic Details
Main Authors: Hernández Mena, Carlos Daniel, Borsky, Michal, Mollberg, David, Guðmundsson, Smári Freyr, Hedström, Staffan, Pálsson, Ragnar, Jónsson, Ólafur Helgi, Þorsteinsdóttir, Sunneva, Guðmundsdóttir, Jóhanna Vigdís, Magnusdottir, Eydis Huld, Þórhallsdóttir, Ragnheiður, Gudnason, Jon
Language:unknown
Published: Abacus Data Network
Subjects:
Online Access:https://hdl.handle.net/11272.1/AB2/LKGTIU
id ftunibritcolumdv:hdl:11272.1/AB2/LKGTIU
record_format openpolar
spelling ftunibritcolumdv:hdl:11272.1/AB2/LKGTIU 2024-09-15T18:13:58+00:00 Samrómur Children Icelandic Speech 1.0 Hernández Mena, Carlos Daniel Borsky, Michal Mollberg, David Guðmundsson, Smári Freyr Hedström, Staffan Pálsson, Ragnar Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Guðmundsdóttir, Jóhanna Vigdís Magnusdottir, Eydis Huld Þórhallsdóttir, Ragnheiður Gudnason, Jon https://hdl.handle.net/11272.1/AB2/LKGTIU unknown Abacus Data Network https://hdl.handle.net/11272.1/AB2/LKGTIU Web collection Other Linguistics ftunibritcolumdv 2024-08-25T23:36:48Z Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years) representing 137,597 utterances. This version 1.0 is equivalent to "Samrómur Children Icelandic Speech 21.09" as used by the Language Technology Programme for Icelandic 2019-2023. Data Speech data was collected between October 2019 and September 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus , which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. Other/Unknown Material Iceland Reykjavik University Abacus Data Network
institution Open Polar
collection Abacus Data Network
op_collection_id ftunibritcolumdv
language unknown
topic Other
Linguistics
spellingShingle Other
Linguistics
Hernández Mena, Carlos Daniel
Borsky, Michal
Mollberg, David
Guðmundsson, Smári Freyr
Hedström, Staffan
Pálsson, Ragnar
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Guðmundsdóttir, Jóhanna Vigdís
Magnusdottir, Eydis Huld
Þórhallsdóttir, Ragnheiður
Gudnason, Jon
Samrómur Children Icelandic Speech 1.0
topic_facet Other
Linguistics
description Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years) representing 137,597 utterances. This version 1.0 is equivalent to "Samrómur Children Icelandic Speech 21.09" as used by the Language Technology Programme for Icelandic 2019-2023. Data Speech data was collected between October 2019 and September 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus , which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM.
author Hernández Mena, Carlos Daniel
Borsky, Michal
Mollberg, David
Guðmundsson, Smári Freyr
Hedström, Staffan
Pálsson, Ragnar
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Guðmundsdóttir, Jóhanna Vigdís
Magnusdottir, Eydis Huld
Þórhallsdóttir, Ragnheiður
Gudnason, Jon
author_facet Hernández Mena, Carlos Daniel
Borsky, Michal
Mollberg, David
Guðmundsson, Smári Freyr
Hedström, Staffan
Pálsson, Ragnar
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Guðmundsdóttir, Jóhanna Vigdís
Magnusdottir, Eydis Huld
Þórhallsdóttir, Ragnheiður
Gudnason, Jon
author_sort Hernández Mena, Carlos Daniel
title Samrómur Children Icelandic Speech 1.0
title_short Samrómur Children Icelandic Speech 1.0
title_full Samrómur Children Icelandic Speech 1.0
title_fullStr Samrómur Children Icelandic Speech 1.0
title_full_unstemmed Samrómur Children Icelandic Speech 1.0
title_sort samrómur children icelandic speech 1.0
publisher Abacus Data Network
url https://hdl.handle.net/11272.1/AB2/LKGTIU
genre Iceland
Reykjavik University
genre_facet Iceland
Reykjavik University
op_source Web collection
op_relation https://hdl.handle.net/11272.1/AB2/LKGTIU
_version_ 1810451750282330112