ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Improving naturalness of Thai text-to-speech synthesis by prosodic rule

Pradit Mittrapiyanuruk, Chatchawarn Hansakunbuntheung, Virongrong Tesprasit, Virach Sornlertlamvanich

This paper presents a method to improve the naturalness of Thai Text-to-speech synthesis, in 4 main parts. In the pausing module, its main function is to determine the break location when synthesizing a Thai text which has no explicit sentence/phrase/word boundary. In the syllable duration and tone generation, a set of rules is provided to generate proper prosodic parameters for synthesizing more natural speech. The syllable duration rule is applied using the KlattÂ’s method to handle the task in syllabic frame. The tonal rule considers the effect of tonal coarticulation and F0 downdrift in generating the F0 contour parameter. In the demisyllable concatenation, the TD-PSOLA technique is applied to modify the waveform for obtaining the required prosody. The LSP-based concatenated boundary smoothing is also included to imitate the crosssyllable coarticulation effect. The result of comparative quality test shows a significant improvement in our proposed method.


Cite as: Mittrapiyanuruk, P., Hansakunbuntheung, C., Tesprasit, V., Sornlertlamvanich, V. (2000) Improving naturalness of Thai text-to-speech synthesis by prosodic rule. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 334-337

@inproceedings{mittrapiyanuruk00_icslp,
  author={Pradit Mittrapiyanuruk and Chatchawarn Hansakunbuntheung and Virongrong Tesprasit and Virach Sornlertlamvanich},
  title={{Improving naturalness of Thai text-to-speech synthesis by prosodic rule}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 334-337}
}