Samrómur Children Icelandic Speech 1.0
Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years)...
Main Authors: | , , , , , , , , , , , |
---|---|
Language: | unknown |
Published: |
Abacus Data Network
|
Subjects: | |
Online Access: | https://hdl.handle.net/11272.1/AB2/LKGTIU |
id |
ftunibritcolumdv:hdl:11272.1/AB2/LKGTIU |
---|---|
record_format |
openpolar |
spelling |
ftunibritcolumdv:hdl:11272.1/AB2/LKGTIU 2024-09-15T18:13:58+00:00 Samrómur Children Icelandic Speech 1.0 Hernández Mena, Carlos Daniel Borsky, Michal Mollberg, David Guðmundsson, Smári Freyr Hedström, Staffan Pálsson, Ragnar Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Guðmundsdóttir, Jóhanna Vigdís Magnusdottir, Eydis Huld Þórhallsdóttir, Ragnheiður Gudnason, Jon https://hdl.handle.net/11272.1/AB2/LKGTIU unknown Abacus Data Network https://hdl.handle.net/11272.1/AB2/LKGTIU Web collection Other Linguistics ftunibritcolumdv 2024-08-25T23:36:48Z Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years) representing 137,597 utterances. This version 1.0 is equivalent to "Samrómur Children Icelandic Speech 21.09" as used by the Language Technology Programme for Icelandic 2019-2023. Data Speech data was collected between October 2019 and September 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus , which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. Other/Unknown Material Iceland Reykjavik University Abacus Data Network |
institution |
Open Polar |
collection |
Abacus Data Network |
op_collection_id |
ftunibritcolumdv |
language |
unknown |
topic |
Other Linguistics |
spellingShingle |
Other Linguistics Hernández Mena, Carlos Daniel Borsky, Michal Mollberg, David Guðmundsson, Smári Freyr Hedström, Staffan Pálsson, Ragnar Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Guðmundsdóttir, Jóhanna Vigdís Magnusdottir, Eydis Huld Þórhallsdóttir, Ragnheiður Gudnason, Jon Samrómur Children Icelandic Speech 1.0 |
topic_facet |
Other Linguistics |
description |
Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology . The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years) representing 137,597 utterances. This version 1.0 is equivalent to "Samrómur Children Icelandic Speech 21.09" as used by the Language Technology Programme for Icelandic 2019-2023. Data Speech data was collected between October 2019 and September 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus , which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. |
author |
Hernández Mena, Carlos Daniel Borsky, Michal Mollberg, David Guðmundsson, Smári Freyr Hedström, Staffan Pálsson, Ragnar Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Guðmundsdóttir, Jóhanna Vigdís Magnusdottir, Eydis Huld Þórhallsdóttir, Ragnheiður Gudnason, Jon |
author_facet |
Hernández Mena, Carlos Daniel Borsky, Michal Mollberg, David Guðmundsson, Smári Freyr Hedström, Staffan Pálsson, Ragnar Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Guðmundsdóttir, Jóhanna Vigdís Magnusdottir, Eydis Huld Þórhallsdóttir, Ragnheiður Gudnason, Jon |
author_sort |
Hernández Mena, Carlos Daniel |
title |
Samrómur Children Icelandic Speech 1.0 |
title_short |
Samrómur Children Icelandic Speech 1.0 |
title_full |
Samrómur Children Icelandic Speech 1.0 |
title_fullStr |
Samrómur Children Icelandic Speech 1.0 |
title_full_unstemmed |
Samrómur Children Icelandic Speech 1.0 |
title_sort |
samrómur children icelandic speech 1.0 |
publisher |
Abacus Data Network |
url |
https://hdl.handle.net/11272.1/AB2/LKGTIU |
genre |
Iceland Reykjavik University |
genre_facet |
Iceland Reykjavik University |
op_source |
Web collection |
op_relation |
https://hdl.handle.net/11272.1/AB2/LKGTIU |
_version_ |
1810451750282330112 |