Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Micro-prosodic Control in Cantonese Text-to-Speech Synthesis

Tan Lee (1), Helen M. Meng (2), Wai H. Lau (1), W. K. Lo (1), P. C. Ching (1)

(1) Department of Electronic Engineering; (2) Department of Systems Engineering & Engineering Management, The Chinese University of Hong Kong, Shatin, Hong Kong

This paper describes a pioneer study on prosodic control for Cantonese text-to-speech synthesis. We attempt to establish a set of segment-level duration rules and context-dependent F0 profiles and apply them to a syllable-based concatenative speech synthesizer which uses TD-PSOLA as prosodic modification technique. The prosodic features are extracted by statistical characterization of a large amount of speech data. Subjective listening test shows that the micro-prosodic control results in a marginal but consistent improvement in perceptual naturalness.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Lee, Tan / Meng, Helen M. / Lau, Wai H. / Lo, W. K. / Ching, P. C. (1999): "Micro-prosodic control in cantonese text-to-speech synthesis", In EUROSPEECH'99, 1855-1858.