ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Intonation modeling of Mandarin Chinese using a superpositional approach

Pablo Daniel Aguero, Antonio Bonafonte, Lu Yu, Juan Carlos Tulli

The intonation model is an important component in text-to-speech systems to obtain natural and expressive speech synthesis. In this paper we propose a superpositional model for Mandarin Chinese. The intonation model is composed of the syllable and the phrase component. The parameters of the model are estimated using JEMA, a training approach with many advantages related to robustness and precision. Parameter estimation and model training are combined into a loop to progressively refine both the parameterization and the model. The high correlation (0.82) between synthetic and original contours in the test data show the suitability of this approach for modeling Mandarin. Furthermore, the high scores got in subjective evaluation (MOS=4.06) confirm the objective results.


doi: 10.21437/Interspeech.2008-553

Cite as: Aguero, P.D., Bonafonte, A., Yu, L., Tulli, J.C. (2008) Intonation modeling of Mandarin Chinese using a superpositional approach. Proc. Interspeech 2008, 2134-2137, doi: 10.21437/Interspeech.2008-553

@inproceedings{aguero08_interspeech,
  author={Pablo Daniel Aguero and Antonio Bonafonte and Lu Yu and Juan Carlos Tulli},
  title={{Intonation modeling of Mandarin Chinese using a superpositional approach}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2134--2137},
  doi={10.21437/Interspeech.2008-553}
}