The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech

International audience Currently available speech recognisers do not usually work well with elderly speech. This is because several characteristics of speech (e.g. fundamental frequency, jitter, shimmer and harmonic noise ratio) change with age and because the acoustic models used by speech recognis...

Full description

Bibliographic Details
Main Authors:	Hämäläinen, Annika, Avelar, Jairo, Rodrigues, Silvia, Sales Dias, Miguel, Kolesiński, Artur, Fegyó, Tibor, Németh, Géza, Csobanka, Petra, Lan Hing Ting, Karine, Hewson, David
Other Authors:	Microsoft Corporation Redmond, Wash., Department of Telecommunications and Media Informatics (BME-TMIT), Budapest University of Technology and Economics Budapest (BME), Bay Zoltán Nonprofit Ltd. for Applied Research, TECHnologies pour la Coopération, l’Interaction et les COnnaissances dans les collectifs (Tech-CICO), Institut Charles Delaunay (ICD), Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Modélisation et Sûreté des Systèmes (LM2S), Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)
Format:	Conference Object
Language:	English
Published:	HAL CCSD 2014
Subjects:	automatic speech recognition corpus elderly speech [SPI]Engineering Sciences [physics] Iceland
Online Access:	https://hal-utt.archives-ouvertes.fr/hal-02896576

id	ftunivnantes:oai:HAL:hal-02896576v1
record_format	openpolar
spelling	ftunivnantes:oai:HAL:hal-02896576v1 2023-05-15T16:50:40+02:00 The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech Hämäläinen, Annika Avelar, Jairo Rodrigues, Silvia Sales Dias, Miguel Kolesiński, Artur Fegyó, Tibor Németh, Géza Csobanka, Petra Lan Hing Ting, Karine Hewson, David Microsoft Corporation Redmond, Wash. Department of Telecommunications and Media Informatics (BME-TMIT) Budapest University of Technology and Economics Budapest (BME) Bay Zoltán Nonprofit Ltd. for Applied Research TECHnologies pour la Coopération, l’Interaction et les COnnaissances dans les collectifs (Tech-CICO) Institut Charles Delaunay (ICD) Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS) Laboratoire Modélisation et Sûreté des Systèmes (LM2S) Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS) Reykjavik, Iceland 2014-05-26 https://hal-utt.archives-ouvertes.fr/hal-02896576 en eng HAL CCSD hal-02896576 https://hal-utt.archives-ouvertes.fr/hal-02896576 LREC 2014, Ninth International Conference on Language Resources and Evaluation https://hal-utt.archives-ouvertes.fr/hal-02896576 LREC 2014, Ninth International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland http://www.lrec-conf.org/proceedings/lrec2014/pdf/365_Paper.pdf automatic speech recognition corpus elderly speech [SPI]Engineering Sciences [physics] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftunivnantes 2023-02-08T05:35:24Z International audience Currently available speech recognisers do not usually work well with elderly speech. This is because several characteristics of speech (e.g. fundamental frequency, jitter, shimmer and harmonic noise ratio) change with age and because the acoustic models used by speech recognisers are typically trained with speech collected from younger adults only. To develop speech-driven applications capable of successfully recognising elderly speech, this type of speech data is needed for training acoustic models from scratch or for adapting acoustic models trained with younger adults' speech. However, the availability of suitable elderly speech corpora is still very limited. This paper describes an ongoing project to design, collect, transcribe and annotate large elderly speech corpora for four European languages: Portuguese, French, Hungarian and Polish. The Portuguese, French and Polish corpora contain read speech only, whereas the Hungarian corpus also contains spontaneous command and control type of speech. Depending on the language in question, the corpora contain 76 to 205 hours of speech collected from 328 to 986 speakers aged 60 and over. The final corpora will come with manually verified orthographic transcriptions, as well as annotations for filled pauses, noises and damaged words. Conference Object Iceland Université de Nantes: HAL-UNIV-NANTES
institution	Open Polar
collection	Université de Nantes: HAL-UNIV-NANTES
op_collection_id	ftunivnantes
language	English
topic	automatic speech recognition corpus elderly speech [SPI]Engineering Sciences [physics]
spellingShingle	automatic speech recognition corpus elderly speech [SPI]Engineering Sciences [physics] Hämäläinen, Annika Avelar, Jairo Rodrigues, Silvia Sales Dias, Miguel Kolesiński, Artur Fegyó, Tibor Németh, Géza Csobanka, Petra Lan Hing Ting, Karine Hewson, David The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
topic_facet	automatic speech recognition corpus elderly speech [SPI]Engineering Sciences [physics]
description	International audience Currently available speech recognisers do not usually work well with elderly speech. This is because several characteristics of speech (e.g. fundamental frequency, jitter, shimmer and harmonic noise ratio) change with age and because the acoustic models used by speech recognisers are typically trained with speech collected from younger adults only. To develop speech-driven applications capable of successfully recognising elderly speech, this type of speech data is needed for training acoustic models from scratch or for adapting acoustic models trained with younger adults' speech. However, the availability of suitable elderly speech corpora is still very limited. This paper describes an ongoing project to design, collect, transcribe and annotate large elderly speech corpora for four European languages: Portuguese, French, Hungarian and Polish. The Portuguese, French and Polish corpora contain read speech only, whereas the Hungarian corpus also contains spontaneous command and control type of speech. Depending on the language in question, the corpora contain 76 to 205 hours of speech collected from 328 to 986 speakers aged 60 and over. The final corpora will come with manually verified orthographic transcriptions, as well as annotations for filled pauses, noises and damaged words.
author2	Microsoft Corporation Redmond, Wash. Department of Telecommunications and Media Informatics (BME-TMIT) Budapest University of Technology and Economics Budapest (BME) Bay Zoltán Nonprofit Ltd. for Applied Research TECHnologies pour la Coopération, l’Interaction et les COnnaissances dans les collectifs (Tech-CICO) Institut Charles Delaunay (ICD) Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS) Laboratoire Modélisation et Sûreté des Systèmes (LM2S) Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)-Université de Technologie de Troyes (UTT)-Centre National de la Recherche Scientifique (CNRS)
format	Conference Object
author	Hämäläinen, Annika Avelar, Jairo Rodrigues, Silvia Sales Dias, Miguel Kolesiński, Artur Fegyó, Tibor Németh, Géza Csobanka, Petra Lan Hing Ting, Karine Hewson, David
author_facet	Hämäläinen, Annika Avelar, Jairo Rodrigues, Silvia Sales Dias, Miguel Kolesiński, Artur Fegyó, Tibor Németh, Géza Csobanka, Petra Lan Hing Ting, Karine Hewson, David
author_sort	Hämäläinen, Annika
title	The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
title_short	The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
title_full	The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
title_fullStr	The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
title_full_unstemmed	The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
title_sort	easr corpora of european portuguese, french, hungarian and polish elderly speech
publisher	HAL CCSD
publishDate	2014
url	https://hal-utt.archives-ouvertes.fr/hal-02896576
op_coverage	Reykjavik, Iceland
genre	Iceland
genre_facet	Iceland
op_source	LREC 2014, Ninth International Conference on Language Resources and Evaluation https://hal-utt.archives-ouvertes.fr/hal-02896576 LREC 2014, Ninth International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland http://www.lrec-conf.org/proceedings/lrec2014/pdf/365_Paper.pdf
op_relation	hal-02896576 https://hal-utt.archives-ouvertes.fr/hal-02896576
_version_	1766040781358891008

The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech

Similar Items