One-formant vocal tract modeling for glottal pulse shape estimation
International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related fre...
Published in: | 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
---|---|
Main Authors: | , |
Other Authors: | , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2015
|
Subjects: | |
Online Access: | https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791 |
id |
ftccsdartic:oai:HAL:hal-01261248v1 |
---|---|
record_format |
openpolar |
spelling |
ftccsdartic:oai:HAL:hal-01261248v1 2023-05-15T15:05:45+02:00 One-formant vocal tract modeling for glottal pulse shape estimation Chien, Yu-Ren Roebel, Axel Analyse et synthèse sonores Paris Sciences et Technologies de la Musique et du Son (STMS) Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS) Brisbane, Australia 2015-04-19 https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791 en eng HAL CCSD info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791 hal-01261248 https://hal.archives-ouvertes.fr/hal-01261248 doi:10.1109/ICASSP.2015.7178791 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https://hal.archives-ouvertes.fr/hal-01261248 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩ speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing info:eu-repo/semantics/conferenceObject Conference papers 2015 ftccsdartic https://doi.org/10.1109/ICASSP.2015.7178791 2021-07-17T23:03:53Z International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation. Conference Object Arctic Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) Arctic Brisbane ENVELOPE(-45.633,-45.633,-60.600,-60.600) 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4345 4349 |
institution |
Open Polar |
collection |
Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) |
op_collection_id |
ftccsdartic |
language |
English |
topic |
speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing |
spellingShingle |
speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing Chien, Yu-Ren Roebel, Axel One-formant vocal tract modeling for glottal pulse shape estimation |
topic_facet |
speech analysis glottal pulse estimation [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing |
description |
International audience This work considers the task of estimating the source and filter from human voice signals. Since the energy of voiced sound concentrates on discrete frequencies, a notable challenge with this task would be that higher pitches in the signal can make the harmonically related frequency response samples of the vocal tract filter an incomplete representation. In view of this, we propose to model the magnitude and phase response of the first formant as an alternative to the minimum phase property of the vocal tract filter. In particular, the magnitude response of the vocal tract filter sampled at the first three partials only, is sufficient for determining the phase response of the first formant. We verified our new method with glottal pulse shape parameter estimation experiments conducted on the CMU Arctic dataset, which showed that single-formant filter is an adequate alternative to minimum-phase filter in vocal tract modeling for glottal pulse shape estimation. |
author2 |
Analyse et synthèse sonores Paris Sciences et Technologies de la Musique et du Son (STMS) Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche et Coordination Acoustique/Musique (IRCAM)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS) |
format |
Conference Object |
author |
Chien, Yu-Ren Roebel, Axel |
author_facet |
Chien, Yu-Ren Roebel, Axel |
author_sort |
Chien, Yu-Ren |
title |
One-formant vocal tract modeling for glottal pulse shape estimation |
title_short |
One-formant vocal tract modeling for glottal pulse shape estimation |
title_full |
One-formant vocal tract modeling for glottal pulse shape estimation |
title_fullStr |
One-formant vocal tract modeling for glottal pulse shape estimation |
title_full_unstemmed |
One-formant vocal tract modeling for glottal pulse shape estimation |
title_sort |
one-formant vocal tract modeling for glottal pulse shape estimation |
publisher |
HAL CCSD |
publishDate |
2015 |
url |
https://hal.archives-ouvertes.fr/hal-01261248 https://doi.org/10.1109/ICASSP.2015.7178791 |
op_coverage |
Brisbane, Australia |
long_lat |
ENVELOPE(-45.633,-45.633,-60.600,-60.600) |
geographic |
Arctic Brisbane |
geographic_facet |
Arctic Brisbane |
genre |
Arctic |
genre_facet |
Arctic |
op_source |
Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https://hal.archives-ouvertes.fr/hal-01261248 Proceeedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.4345 - 4349, ⟨10.1109/ICASSP.2015.7178791⟩ |
op_relation |
info:eu-repo/semantics/altIdentifier/doi/10.1109/ICASSP.2015.7178791 hal-01261248 https://hal.archives-ouvertes.fr/hal-01261248 doi:10.1109/ICASSP.2015.7178791 |
op_doi |
https://doi.org/10.1109/ICASSP.2015.7178791 |
container_title |
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
container_start_page |
4345 |
op_container_end_page |
4349 |
_version_ |
1766337392671719424 |