One-formant vocal tract modeling for glottal pulse shape estimation

International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related fre...

Full description

Bibliographic Details
Published in:	2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Main Authors:	Chien, Yu-Ren, Roebel, Axel
Other Authors:	Analyse et synthèse sonores Paris, Sciences et Technologies de la Musique et du Son (STMS), Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
Format:	Conference Object
Language:	English
Published:	HAL CCSD 2015
Subjects:	speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing Arctic Brisbane
Online Access:	https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791

Description
Summary:	International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation.

One-formant vocal tract modeling for glottal pulse shape estimation

Similar Items