USTC system for Blizzard Challenge 2006 an improved HMM-based speech synthesis method

This paper introduces the USTC speech synthesis system for Blizzard Challenge 2006. The HMM-based parametric synthesis approach was adopted for its convenience and effectiveness in building a new voice, especially for the nonnative developers. Some useful techniques were also integrated into our sys...

Full description

Bibliographic Details
Main Authors: Zhen-hua Ling, Yi-jian Wu, Yu-ping Wang, Long Qin, Ren-hua Wang
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Published: 2006
Subjects:
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.130.7143
http://festvox.org/blizzard/bc2006/ustc_blizzard2006.pdf
Description
Summary:This paper introduces the USTC speech synthesis system for Blizzard Challenge 2006. The HMM-based parametric synthesis approach was adopted for its convenience and effectiveness in building a new voice, especially for the nonnative developers. Some useful techniques were also integrated into our system, such as minimum generation error (MGE) training, phone duration modeling and linear spectral pair (LSP) based formant enhancement. The evaluation results show that the proposed system is able to synthesize speech with high naturalness and intelligibility by using either full database or only ARCTIC subset. 1.