11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Improved Modelling of Speech Dynamics Using Non-Linear Formant Trajectories for HMM-Based Speech Synthesis

Hongwei Hu, Martin J. Russell

University of Birmingham, UK

This paper describes the use of non-linear formant trajectories to model speech dynamics. The performance of the non-linear formant dynamics model is evaluated using HMM-based speech synthesis experiments, in which the 12 dimensional parallel formant synthesiser control parameters and their time derivatives are used as the feature vectors in the HMM. Two types of formant synthesiser control parameters, named piecewise constant and smooth trajectory parameters, are used to drive the classic parallel formant synthesiser. The quality of the synthetic speech is assessed using three kinds of subjective tests. This paper shows that the non-linear formant dynamics model can improve the performance of HMM-based speech synthesis.

Full Paper

Bibliographic reference.  Hu, Hongwei / Russell, Martin J. (2010): "Improved modelling of speech dynamics using non-linear formant trajectories for HMM-based speech synthesis", In INTERSPEECH-2010, 821-824.