Traditional source-filter model has obvious limitation for speech synthesis in pitch modification due to the lack of spectrum distortion processing. To solve the problem, the paper analyzes the spectrum features of voice source in various F0 ranges and timbres in detail, and generates Muliti-Source (MS) based on analysis results by classifying the voice source into different types. The model enhances the quality of speech synthesis in various speaking mood.
Cite as: Tao, J., Kang, Y. (2004) Multi-source based acoustic model for speech synthesis. Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5), 167-172
@inproceedings{tao04_ssw, author={Jianhua Tao and Yongguo Kang}, title={{Multi-source based acoustic model for speech synthesis}}, year=2004, booktitle={Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5)}, pages={167--172} }