Epoch extraction from speech signals

Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the instant of glottal closure. Extraction of epochs from speech is a challenging task due to time-varying characteristics...

Full description

Bibliographic Details
Main Authors: Murty, K. S. R., Yegnanarayana, B.
Format: Article in Journal/Newspaper
Language:unknown
Published: IEEE 2008
Subjects:
Online Access:http://repository.ias.ac.in/57769/
http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4648930
id ftindianacasci:oai:repository.ias.ac.in:57769
record_format openpolar
spelling ftindianacasci:oai:repository.ias.ac.in:57769 2023-05-15T15:07:56+02:00 Epoch extraction from speech signals Murty, K. S. R. Yegnanarayana, B. 2008-11 http://repository.ias.ac.in/57769/ http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4648930 unknown IEEE Murty, K. S. R. Yegnanarayana, B. (2008) Epoch extraction from speech signals IEEE Transactions on Audio, Speech and Language Processing, 16 (8). pp. 1602-1613. ISSN 1558-7916 QA75 Electronic computers. Computer science Article PeerReviewed 2008 ftindianacasci 2013-01-20T12:06:21Z Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the instant of glottal closure. Extraction of epochs from speech is a challenging task due to time-varying characteristics of the source and the system. Most epoch extraction methods attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. The performance of such methods depends critically on our ability to model the system. In this paper, we propose a method for epoch extraction which does not depend critically on characteristics of the time-varying vocal-tract system. The method exploits the nature of impulse-like excitation. The proposed zero resonance frequency filter output brings out the epoch locations with high accuracy and reliability. The performance of the method is demonstrated using CMU-Arctic database using the epoch information from the electroglottograph as reference. The proposed method performs significantly better than the other methods currently available for epoch extraction. The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise. Article in Journal/Newspaper Arctic Indian Academy of Sciences: Publication of Fellows Arctic
institution Open Polar
collection Indian Academy of Sciences: Publication of Fellows
op_collection_id ftindianacasci
language unknown
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Murty, K. S. R.
Yegnanarayana, B.
Epoch extraction from speech signals
topic_facet QA75 Electronic computers. Computer science
description Epoch is the instant of significant excitation of the vocal-tract system during production of speech. For most voiced speech, the most significant excitation takes place around the instant of glottal closure. Extraction of epochs from speech is a challenging task due to time-varying characteristics of the source and the system. Most epoch extraction methods attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. The performance of such methods depends critically on our ability to model the system. In this paper, we propose a method for epoch extraction which does not depend critically on characteristics of the time-varying vocal-tract system. The method exploits the nature of impulse-like excitation. The proposed zero resonance frequency filter output brings out the epoch locations with high accuracy and reliability. The performance of the method is demonstrated using CMU-Arctic database using the epoch information from the electroglottograph as reference. The proposed method performs significantly better than the other methods currently available for epoch extraction. The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise.
format Article in Journal/Newspaper
author Murty, K. S. R.
Yegnanarayana, B.
author_facet Murty, K. S. R.
Yegnanarayana, B.
author_sort Murty, K. S. R.
title Epoch extraction from speech signals
title_short Epoch extraction from speech signals
title_full Epoch extraction from speech signals
title_fullStr Epoch extraction from speech signals
title_full_unstemmed Epoch extraction from speech signals
title_sort epoch extraction from speech signals
publisher IEEE
publishDate 2008
url http://repository.ias.ac.in/57769/
http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4648930
geographic Arctic
geographic_facet Arctic
genre Arctic
genre_facet Arctic
op_relation Murty, K. S. R.
Yegnanarayana, B. (2008) Epoch extraction from speech signals IEEE Transactions on Audio, Speech and Language Processing, 16 (8). pp. 1602-1613. ISSN 1558-7916
_version_ 1766339361482211328