ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge

Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi, Ren-Hua Wang

This paper presents a method to control the characteristics of synthetic speech flexibly by integrating articulatory features into a Hidden Markov Model (HMM)-based parametric speech synthesis system. In contrast to model adaptation and interpolation approaches for speaking style control, this method is driven by phonetic knowledge, and target speech samples are not required. The joint distribution of parallel acoustic and articulatory features considering cross-stream feature dependency is estimated. At synthesis time, acoustic and articulatory features are generated simultaneously based on the maximum-likelihood criterion. The synthetic speech can be controlled flexibly by modifying the generated articulatory features according to arbitrary phonetic rules in the parameter generation process. Our experiments show that the proposed method is effective in both changing the overall character of synthesized speech and in controlling the quality of a specific vowel.


doi: 10.21437/Interspeech.2008-169

Cite as: Ling, Z.-H., Richmond, K., Yamagishi, J., Wang, R.-H. (2008) Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge. Proc. Interspeech 2008, 573-576, doi: 10.21437/Interspeech.2008-169

@inproceedings{ling08_interspeech,
  author={Zhen-Hua Ling and Korin Richmond and Junichi Yamagishi and Ren-Hua Wang},
  title={{Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={573--576},
  doi={10.21437/Interspeech.2008-169}
}