Tone Modeling Using Stress Information for HMM-Based Thai Speech Synthesis

Decha Moungsri, Tomoki Koriyama, Takashi Nose, Takao Kobayashi


This paper describes a modeling technique of Thai tones for HMM-based speech synthesis. Tones are important prosodic features for tonal language including Thai because the phonetically same words but with different tones give different meanings. Although there have been several approaches to improving tone correctness of synthetic speech by considering tone types, another significant factor, stress, was not used explicitly for prosody modeling. We incorporate stress/unstress information into the framework of the HMM-based speech synthesis. Objective and subjective evaluation results show that the use of stress information improves the performance in Thai tone modeling.


 DOI: 10.21437/SpeechProsody.2014-200

Cite as: Moungsri, D., Koriyama, T., Nose, T., Kobayashi, T. (2014) Tone Modeling Using Stress Information for HMM-Based Thai Speech Synthesis. Proc. 7th International Conference on Speech Prosody 2014, 1057-1061, DOI: 10.21437/SpeechProsody.2014-200.


@inproceedings{Moungsri2014,
  author={Decha Moungsri and Tomoki Koriyama and Takashi Nose and Takao Kobayashi},
  title={{Tone Modeling Using Stress Information for HMM-Based Thai Speech Synthesis}},
  year=2014,
  booktitle={Proc. 7th International Conference on Speech Prosody 2014},
  pages={1057--1061},
  doi={10.21437/SpeechProsody.2014-200},
  url={http://dx.doi.org/10.21437/SpeechProsody.2014-200}
}