Speech and Noise Corpora for Pitch Estimation of Human Speech
This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommerc...
Main Author: | |
---|---|
Format: | Dataset |
Language: | English |
Published: |
Zenodo
2020
|
Subjects: | |
Online Access: | https://dx.doi.org/10.5281/zenodo.3920591 https://zenodo.org/record/3920591 |
_version_ | 1821828756999766016 |
---|---|
author | Bechtold, Bastian |
author_facet | Bechtold, Bastian |
author_sort | Bechtold, Bastian |
collection | DataCite |
description | This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommercial use ) [3] MOCHA-TIMIT ( free for noncommercial use ) [4] PTDB-TUG ( ODBL license ) [5] NOISEX ( free to download ) [7] QUT-NOISE ( CC-BY-SA license ) [8] These files are published as part of my dissertation, "Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods", and in support of the Replication Dataset for Fundamental Frequency Estimation. References: John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003. Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993. F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995. Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999. Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011. John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993. Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993. David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010. |
format | Dataset |
genre | Arctic |
genre_facet | Arctic |
geographic | Arctic Mervyn |
geographic_facet | Arctic Mervyn |
id | ftdatacite:10.5281/zenodo.3920591 |
institution | Open Polar |
language | English |
long_lat | ENVELOPE(65.307,65.307,-70.509,-70.509) |
op_collection_id | ftdatacite |
op_doi | https://doi.org/10.5281/zenodo.3920591 https://doi.org/10.5281/zenodo.3920590 |
op_relation | https://dx.doi.org/10.5281/zenodo.3920590 |
op_rights | Open Access info:eu-repo/semantics/openAccess |
publishDate | 2020 |
publisher | Zenodo |
record_format | openpolar |
spelling | ftdatacite:10.5281/zenodo.3920591 2025-01-16T20:33:41+00:00 Speech and Noise Corpora for Pitch Estimation of Human Speech Bechtold, Bastian 2020 https://dx.doi.org/10.5281/zenodo.3920591 https://zenodo.org/record/3920591 en eng Zenodo https://dx.doi.org/10.5281/zenodo.3920590 Open Access info:eu-repo/semantics/openAccess speech noise dataset Dataset 2020 ftdatacite https://doi.org/10.5281/zenodo.3920591 https://doi.org/10.5281/zenodo.3920590 2021-11-05T12:55:41Z This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution: CMU-ARCTIC ( BSD license) [1] FDA ( free to download) [2] KEELE ( free for noncommercial use ) [3] MOCHA-TIMIT ( free for noncommercial use ) [4] PTDB-TUG ( ODBL license ) [5] NOISEX ( free to download ) [7] QUT-NOISE ( CC-BY-SA license ) [8] These files are published as part of my dissertation, "Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods", and in support of the Replication Dataset for Fundamental Frequency Estimation. References: John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003. Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993. F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995. Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999. Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011. John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993. Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993. David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010. Dataset Arctic DataCite Arctic Mervyn ENVELOPE(65.307,65.307,-70.509,-70.509) |
spellingShingle | speech noise Bechtold, Bastian Speech and Noise Corpora for Pitch Estimation of Human Speech |
title | Speech and Noise Corpora for Pitch Estimation of Human Speech |
title_full | Speech and Noise Corpora for Pitch Estimation of Human Speech |
title_fullStr | Speech and Noise Corpora for Pitch Estimation of Human Speech |
title_full_unstemmed | Speech and Noise Corpora for Pitch Estimation of Human Speech |
title_short | Speech and Noise Corpora for Pitch Estimation of Human Speech |
title_sort | speech and noise corpora for pitch estimation of human speech |
topic | speech noise |
topic_facet | speech noise |
url | https://dx.doi.org/10.5281/zenodo.3920591 https://zenodo.org/record/3920591 |