One-formant vocal tract modeling for glottal pulse shape estimation

International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related fre...

Full description

Bibliographic Details
Published in:	2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Main Authors:	Chien, Yu-Ren, Roebel, Axel
Other Authors:	Analyse et synthèse sonores Paris, Sciences et Technologies de la Musique et du Son (STMS), Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
Format:	Conference Object
Language:	English
Published:	HAL CCSD 2015
Subjects:	speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing Arctic Brisbane
Online Access:	https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791

id	ftccsdartic:oai:HAL:hal-01261248v1
record_format	openpolar
spelling	ftccsdartic:oai:HAL:hal-01261248v1 2023-05-15T15:05:45+02:00 One-formant vocal tract modeling for glottal pulse shape estimation Chien, Yu-Ren Roebel, Axel Analyse et synthèse sonores Paris Sciences et Technologies de la Musique et du Son (STMS) Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS) Brisbane, Australia 2015-04-19 https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791 en eng HAL CCSD info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791 hal-01261248 https://hal.archives-ouvertes.fr/hal-01261248 doi:10.1109/ICASSP.2015.7178791 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https://hal.archives-ouvertes.fr/hal-01261248 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩ speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing info:eu-repo/semantics/conferenceObject Conference papers 2015 ftccsdartic https://doi.org/10.1109/ICASSP.2015.7178791 2021-07-17T23:03:53Z International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation. Conference Object Arctic Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) Arctic Brisbane ENVELOPE(-45.633,-45.633,-60.600,-60.600) 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4345 4349
institution	Open Polar
collection	Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id	ftccsdartic
language	English
topic	speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
spellingShingle	speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing Chien, Yu-Ren Roebel, Axel One-formant vocal tract modeling for glottal pulse shape estimation
topic_facet	speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
description	International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation.
author2	Analyse et synthèse sonores Paris Sciences et Technologies de la Musique et du Son (STMS) Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
format	Conference Object
author	Chien, Yu-Ren Roebel, Axel
author_facet	Chien, Yu-Ren Roebel, Axel
author_sort	Chien, Yu-Ren
title	One-formant vocal tract modeling for glottal pulse shape estimation
title_short	One-formant vocal tract modeling for glottal pulse shape estimation
title_full	One-formant vocal tract modeling for glottal pulse shape estimation
title_fullStr	One-formant vocal tract modeling for glottal pulse shape estimation
title_full_unstemmed	One-formant vocal tract modeling for glottal pulse shape estimation
title_sort	one-formant vocal tract modeling for glottal pulse shape estimation
publisher	HAL CCSD
publishDate	2015
url	https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791
op_coverage	Brisbane, Australia
long_lat	ENVELOPE(-45.633,-45.633,-60.600,-60.600)
geographic	Arctic Brisbane
geographic_facet	Arctic Brisbane
genre	Arctic
genre_facet	Arctic
op_source	Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https://hal.archives-ouvertes.fr/hal-01261248 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩
op_relation	info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791 hal-01261248 https://hal.archives-ouvertes.fr/hal-01261248 doi:10.1109/ICASSP.2015.7178791
op_doi	https://doi.org/10.1109/ICASSP.2015.7178791
container_title	2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
container_start_page	4345
op_container_end_page	4349
_version_	1766337392671719424

One-formant vocal tract modeling for glottal pulse shape estimation

Similar Items