transform (DCT) representations on both syllable-level tone and phrase-level intonation for Chinese Mandarin speech. Decision trees growing with maximum likelihood (ML) and stopping with minimum description length (MDL) are used to cluster very rich context-dependent DCT models into generalized ones to predict unseen contexts in test robustly. Additionally, we propose to generate Mandarin tone contours by jointly optimizing F0 contours of syllable and phrase in ML sense. Experimental results on speaker-dependent continuous and speakerindependent isolated speech corpora show that the proposed approach can be able to generate F0 contour with high correlation coefficients of 0.92 and 0.82 respectively, measured between the original and generated F0. Keywords-F0 modeling, F0 generating, DCT, Mandarin tone, Intonation
Cite as: Wu, Z., Qian, Y., Soong, F.K. (2008) Modeling and Generating Tone Contour with Phrase Intonation for Chinese Mandarin Speech. Proc. International Symposium on Chinese Spoken Language Processing, 121-124
@inproceedings{wu08b_iscslp, author={Zhizheng Wu and Yao Qian and Frank K. Soong}, title={{Modeling and Generating Tone Contour with Phrase Intonation for Chinese Mandarin Speech}}, year=2008, booktitle={Proc. International Symposium on Chinese Spoken Language Processing}, pages={121--124} }