Epoch Extraction From Speech Signals
Abstract—Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the in-stant of glottal closure. Extraction of epochs from speech is a chal-lenging task due to time-varying chara...
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Text |
Language: | English |
Subjects: | |
Online Access: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.586.7214 http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf |
id |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.586.7214 |
---|---|
record_format |
openpolar |
spelling |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.586.7214 2023-05-15T15:08:54+02:00 Epoch Extraction From Speech Signals K. Sri Rama Murty B. Yegnanarayana Senior Member The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.586.7214 http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.586.7214 http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf text ftciteseerx 2016-01-08T13:17:57Z Abstract—Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the in-stant of glottal closure. Extraction of epochs from speech is a chal-lenging task due to time-varying characteristics of the source and the system. Most epoch extraction methods attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. The performance of such methods depends critically on our ability to model the system. In this paper, we propose a method for epoch extraction which does not depend critically on characteristics of the time-varying vocal-tract system. The method exploits the nature of impulse-like exci-tation. The proposed zero resonance frequency filter output brings out the epoch locations with high accuracy and reliability. The per-formance of the method is demonstrated using CMU-Arctic data-base using the epoch information from the electro-glottograph as reference. The proposed method performs significantly better than the other methods currently available for epoch extraction. The in-teresting part of the results is that the epoch extraction by the pro-posed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise. Index Terms—Epoch extraction, glottal closure instant, group-delay, Hilbert envelope, instantaneous frequency. I. Text Arctic Unknown Arctic |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
ftciteseerx |
language |
English |
description |
Abstract—Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the in-stant of glottal closure. Extraction of epochs from speech is a chal-lenging task due to time-varying characteristics of the source and the system. Most epoch extraction methods attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. The performance of such methods depends critically on our ability to model the system. In this paper, we propose a method for epoch extraction which does not depend critically on characteristics of the time-varying vocal-tract system. The method exploits the nature of impulse-like exci-tation. The proposed zero resonance frequency filter output brings out the epoch locations with high accuracy and reliability. The per-formance of the method is demonstrated using CMU-Arctic data-base using the epoch information from the electro-glottograph as reference. The proposed method performs significantly better than the other methods currently available for epoch extraction. The in-teresting part of the results is that the epoch extraction by the pro-posed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise. Index Terms—Epoch extraction, glottal closure instant, group-delay, Hilbert envelope, instantaneous frequency. I. |
author2 |
The Pennsylvania State University CiteSeerX Archives |
format |
Text |
author |
K. Sri Rama Murty B. Yegnanarayana Senior Member |
spellingShingle |
K. Sri Rama Murty B. Yegnanarayana Senior Member Epoch Extraction From Speech Signals |
author_facet |
K. Sri Rama Murty B. Yegnanarayana Senior Member |
author_sort |
K. Sri |
title |
Epoch Extraction From Speech Signals |
title_short |
Epoch Extraction From Speech Signals |
title_full |
Epoch Extraction From Speech Signals |
title_fullStr |
Epoch Extraction From Speech Signals |
title_full_unstemmed |
Epoch Extraction From Speech Signals |
title_sort |
epoch extraction from speech signals |
url |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.586.7214 http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf |
geographic |
Arctic |
geographic_facet |
Arctic |
genre |
Arctic |
genre_facet |
Arctic |
op_source |
http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf |
op_relation |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.586.7214 http://speech.iiit.ac.in/svlpubs/article/MurtyK.S.R.Yegna2008.pdf |
op_rights |
Metadata may be used without restrictions as long as the oai identifier remains attached to it. |
_version_ |
1766340178445598720 |