10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Voiced/Unvoiced Decision Algorithm for HMM-Based Speech Synthesis

Shiyin Kang (1), Zhiwei Shuang (2), Quansheng Duan (1), Yong Qin (2), Lianhong Cai (1)

(1) Tsinghua University, China
(2) IBM China Research Lab, China

This paper introduces a novel method to improve the U/V decision method in HMM-based speech synthesis. In the conventional method, the U/V decision of each state is independently made, and a state in the middle of a vowel may be decided as unvoiced. In this paper, we propose to utilize the constraints of natural speech to improve the U/V decision inside a unit, such as syllable or phone. We use a GMM-based U/V change time model to select the best U/V change time in one unit, and refine the U/V decision of all states in that unit based on the selected change time. The result of a perceptual evaluation demonstrates that the proposed method can significantly improve the naturalness of the synthetic speech.

Full Paper

Bibliographic reference.  Kang, Shiyin / Shuang, Zhiwei / Duan, Quansheng / Qin, Yong / Cai, Lianhong (2009): "Voiced/unvoiced decision algorithm for HMM-based speech synthesis", In INTERSPEECH-2009, 412-415.