id ftccsdartic:oai:HAL:hal-01261248v1
record_format openpolar
spelling ftccsdartic:oai:HAL:hal-01261248v1 2023-05-15T15:05:45+02:00 One-formant vocal tract modeling for glottal pulse shape estimation Chien, Yu-Ren Roebel, Axel Analyse et synthèse sonores Paris Sciences et Technologies de la Musique et du Son (STMS) Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS) Brisbane, Australia 2015-04-19 https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791 en eng HAL CCSD info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791 hal-01261248 https://hal.archives-ouvertes.fr/hal-01261248 doi:10.1109/ICASSP.2015.7178791 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https://hal.archives-ouvertes.fr/hal-01261248 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩ speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing info:eu-repo/semantics/conferenceObject Conference papers 2015 ftccsdartic https://doi.org/10.1109/ICASSP.2015.7178791 2021-07-17T23:03:53Z International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation. Conference Object Arctic Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) Arctic Brisbane ENVELOPE(-45.633,-45.633,-60.600,-60.600) 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4345 4349
institution Open Polar
collection Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id ftccsdartic
language English
topic speech analysis
glottal pulse estimation
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
spellingShingle speech analysis
glottal pulse estimation
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Chien, Yu-Ren
Roebel, Axel
One-formant vocal tract modeling for glottal pulse shape estimation
topic_facet speech analysis
glottal pulse estimation
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
description International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation.
author2 Analyse et synthèse sonores Paris
Sciences et Technologies de la Musique et du Son (STMS)
Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
format Conference Object
author Chien, Yu-Ren
Roebel, Axel
author_facet Chien, Yu-Ren
Roebel, Axel
author_sort Chien, Yu-Ren
title One-formant vocal tract modeling for glottal pulse shape estimation
title_short One-formant vocal tract modeling for glottal pulse shape estimation
title_full One-formant vocal tract modeling for glottal pulse shape estimation
title_fullStr One-formant vocal tract modeling for glottal pulse shape estimation
title_full_unstemmed One-formant vocal tract modeling for glottal pulse shape estimation
title_sort one-formant vocal tract modeling for glottal pulse shape estimation
publisher HAL CCSD
publishDate 2015
url https://hal.archives-ouvertes.fr/hal-01261248
https://doi.org/10.1109/ICASSP.2015.7178791
op_coverage Brisbane, Australia
long_lat ENVELOPE(-45.633,-45.633,-60.600,-60.600)
geographic Arctic
Brisbane
geographic_facet Arctic
Brisbane
genre Arctic
genre_facet Arctic
op_source Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
https://hal.archives-ouvertes.fr/hal-01261248
Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩
op_relation info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791
hal-01261248
https://hal.archives-ouvertes.fr/hal-01261248
doi:10.1109/ICASSP.2015.7178791
op_doi https://doi.org/10.1109/ICASSP.2015.7178791
container_title 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
container_start_page 4345
op_container_end_page 4349
_version_ 1766337392671719424