The intonation model is an important component in text-to-speech systems to obtain natural and expressive speech synthesis. In this paper we propose a superpositional model for Mandarin Chinese. The intonation model is composed of the syllable and the phrase component. The parameters of the model are estimated using JEMA, a training approach with many advantages related to robustness and precision. Parameter estimation and model training are combined into a loop to progressively refine both the parameterization and the model. The high correlation (0.82) between synthetic and original contours in the test data show the suitability of this approach for modeling Mandarin. Furthermore, the high scores got in subjective evaluation (MOS=4.06) confirm the objective results.
Bibliographic reference. Aguero, Pablo Daniel / Bonafonte, Antonio / Yu, Lu / Tulli, Juan Carlos (2008): "Intonation modeling of Mandarin Chinese using a superpositional approach", In INTERSPEECH-2008, 2134-2137.