Speech and Noise Corpora for Pitch Estimation of Human Speech

This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommerc...

Full description

Bibliographic Details
Main Author: Bastian Bechtold
Format: Other/Unknown Material
Language:English
Published: Zenodo 2020
Subjects:
Online Access:https://doi.org/10.5281/zenodo.3920591
_version_ 1821828856871387136
author Bastian Bechtold
author_facet Bastian Bechtold
author_sort Bastian Bechtold
collection Zenodo
description This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommercial use ) [3] MOCHA-TIMIT ( free for noncommercial use ) [4] PTDB-TUG ( ODBL license ) [5] NOISEX ( free to download ) [7] QUT-NOISE ( CC-BY-SA license ) [8] These files are published as part of my dissertation, " Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods ", and in support of the Replication Dataset for Fundamental Frequency Estimation . References: John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003. Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993. F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995. Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999. Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011. John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993. Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993. David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010.
format Other/Unknown Material
genre Arctic
genre_facet Arctic
geographic Arctic
Mervyn
geographic_facet Arctic
Mervyn
id ftzenodo:oai:zenodo.org:3920591
institution Open Polar
language English
long_lat ENVELOPE(65.307,65.307,-70.509,-70.509)
op_collection_id ftzenodo
op_doi https://doi.org/10.5281/zenodo.392059110.5281/zenodo.3920590
op_relation https://doi.org/10.5281/zenodo.3920590
https://doi.org/10.5281/zenodo.3920591
oai:zenodo.org:3920591
op_rights info:eu-repo/semantics/openAccess
Other (Non-Commercial)
publishDate 2020
publisher Zenodo
record_format openpolar
spelling ftzenodo:oai:zenodo.org:3920591 2025-01-16T20:33:47+00:00 Speech and Noise Corpora for Pitch Estimation of Human Speech Bastian Bechtold 2020-06-29 https://doi.org/10.5281/zenodo.3920591 eng eng Zenodo https://doi.org/10.5281/zenodo.3920590 https://doi.org/10.5281/zenodo.3920591 oai:zenodo.org:3920591 info:eu-repo/semantics/openAccess Other (Non-Commercial) speech noise info:eu-repo/semantics/other 2020 ftzenodo https://doi.org/10.5281/zenodo.392059110.5281/zenodo.3920590 2024-12-05T07:38:00Z This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommercial use ) [3] MOCHA-TIMIT ( free for noncommercial use ) [4] PTDB-TUG ( ODBL license ) [5] NOISEX ( free to download ) [7] QUT-NOISE ( CC-BY-SA license ) [8] These files are published as part of my dissertation, " Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods ", and in support of the Replication Dataset for Fundamental Frequency Estimation . References: John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003. Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993. F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995. Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999. Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011. John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993. Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993. David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010. Other/Unknown Material Arctic Zenodo Arctic Mervyn ENVELOPE(65.307,65.307,-70.509,-70.509)
spellingShingle speech
noise
Bastian Bechtold
Speech and Noise Corpora for Pitch Estimation of Human Speech
title Speech and Noise Corpora for Pitch Estimation of Human Speech
title_full Speech and Noise Corpora for Pitch Estimation of Human Speech
title_fullStr Speech and Noise Corpora for Pitch Estimation of Human Speech
title_full_unstemmed Speech and Noise Corpora for Pitch Estimation of Human Speech
title_short Speech and Noise Corpora for Pitch Estimation of Human Speech
title_sort speech and noise corpora for pitch estimation of human speech
topic speech
noise
topic_facet speech
noise
url https://doi.org/10.5281/zenodo.3920591