Voiced/nonvoiced detection based on robustness of voiced epochs

In this paper, a new method for voiced/nonvoiced detection based on epoch extraction is proposed. Zero-frequency filtered speech signal is used to extract the instants of significant excitation (or epochs). The robustness of the method to extract epochs in the voiced regions, even with small amount...

Full description

Bibliographic Details
Main Authors: Dhananjaya, N., Yegnanarayana, B.
Format: Article in Journal/Newspaper
Language:unknown
Published: IEEE 2010
Subjects:
Online Access:http://repository.ias.ac.in/57787/
http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=5353617
Description
Summary:In this paper, a new method for voiced/nonvoiced detection based on epoch extraction is proposed. Zero-frequency filtered speech signal is used to extract the instants of significant excitation (or epochs). The robustness of the method to extract epochs in the voiced regions, even with small amount of additive white noise, is used to distinguish voiced epochs from random instants detected in nonvoiced regions. The main feature of the proposed method is that it uses the strength of glottal activity as against using the periodicity of the signal. Performance of the proposed algorithm is studied on TIMIT and CMU ARCTIC databases, for two different noise types, white and vehicle noise from the NOISEX database, at different signal-to-noise ratios (SNRs). The proposed method performs similar or better than the popular normalized crosscorrelation based voiced/nonvoiced detection used in the open source utility wavesurfer, especially at lower SNRs.