Samrómur Queries Icelandic Speech 1.0 ...

Samrómur Queries Icelandic Speech 1.0 (LDC2023S05) was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 20 hours of Icelandic prompted queries from 3,809 speakers representing 17,475 utterances. This v...

Full description

Bibliographic Details
Main Authors: Hedström, Staffan, Fong, Judy, Þórhallsdóttir, Ragnheiður, Mollberg, David, Guðmundsson, Smári Freyr, Jónsson, Ólafur Helgi, Þorsteinsdóttir, Sunneva, Magnúsdóttir, Eydís Huld, Gudnason, Jon
Format: Dataset
Language:unknown
Published: Linguistic Data Consortium 2023
Subjects:
Online Access:https://dx.doi.org/10.35111/aq18-1540
https://catalog.ldc.upenn.edu/LDC2023S05
id ftdatacite:10.35111/aq18-1540
record_format openpolar
spelling ftdatacite:10.35111/aq18-1540 2023-10-01T03:56:55+02:00 Samrómur Queries Icelandic Speech 1.0 ... Hedström, Staffan Fong, Judy Þórhallsdóttir, Ragnheiður Mollberg, David Guðmundsson, Smári Freyr Jónsson, Ólafur Helgi Þorsteinsdóttir, Sunneva Magnúsdóttir, Eydís Huld Gudnason, Jon 2023 https://dx.doi.org/10.35111/aq18-1540 https://catalog.ldc.upenn.edu/LDC2023S05 unknown Linguistic Data Consortium Dataset dataset 2023 ftdatacite https://doi.org/10.35111/aq18-1540 2023-09-04T13:59:03Z Samrómur Queries Icelandic Speech 1.0 (LDC2023S05) was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 20 hours of Icelandic prompted queries from 3,809 speakers representing 17,475 utterances. This version 1.0 is equivalent to "Samrómur Queries Icelandic Speech 21.12" as used by the Language Technology Programme for Icelandic 2019-2023. Speech data was collected between October 2019 and December 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus, which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, ... Dataset Iceland Reykjavik University DataCite Metadata Store (German National Library of Science and Technology)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
description Samrómur Queries Icelandic Speech 1.0 (LDC2023S05) was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 20 hours of Icelandic prompted queries from 3,809 speakers representing 17,475 utterances. This version 1.0 is equivalent to "Samrómur Queries Icelandic Speech 21.12" as used by the Language Technology Programme for Icelandic 2019-2023. Speech data was collected between October 2019 and December 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus, which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, ...
format Dataset
author Hedström, Staffan
Fong, Judy
Þórhallsdóttir, Ragnheiður
Mollberg, David
Guðmundsson, Smári Freyr
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Magnúsdóttir, Eydís Huld
Gudnason, Jon
spellingShingle Hedström, Staffan
Fong, Judy
Þórhallsdóttir, Ragnheiður
Mollberg, David
Guðmundsson, Smári Freyr
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Magnúsdóttir, Eydís Huld
Gudnason, Jon
Samrómur Queries Icelandic Speech 1.0 ...
author_facet Hedström, Staffan
Fong, Judy
Þórhallsdóttir, Ragnheiður
Mollberg, David
Guðmundsson, Smári Freyr
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Magnúsdóttir, Eydís Huld
Gudnason, Jon
author_sort Hedström, Staffan
title Samrómur Queries Icelandic Speech 1.0 ...
title_short Samrómur Queries Icelandic Speech 1.0 ...
title_full Samrómur Queries Icelandic Speech 1.0 ...
title_fullStr Samrómur Queries Icelandic Speech 1.0 ...
title_full_unstemmed Samrómur Queries Icelandic Speech 1.0 ...
title_sort samrómur queries icelandic speech 1.0 ...
publisher Linguistic Data Consortium
publishDate 2023
url https://dx.doi.org/10.35111/aq18-1540
https://catalog.ldc.upenn.edu/LDC2023S05
genre Iceland
Reykjavik University
genre_facet Iceland
Reykjavik University
op_doi https://doi.org/10.35111/aq18-1540
_version_ 1778527602875564032