Prosody is an important factor for a high quality text-tospeech (TTS) system. Prosody is often described with a hierarchical structure. So the generation of the hierarchical prosody structure is very important both in the corpus building and the real-time text analysis, but the prosody labeling procedure is laborious and time consuming. In this paper, an automatic prosody boundary label system is presented, in which the classification and regression tree (CART) framework is used. In this system, we build a prosody model using acoustic information and the text information based on large speech corpus with prosodic structure label (ASCCD). Experiments show this model can achieve prosody boundary detection 90.86% accuracy. Index Terms— prosody boundary, CART, Chinese information processing, prosody prediction, acousticprosodic feature
Cite as: Ni, C.-J., Liu, W.-J., Xu, B. (2008) Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information. Proc. International Symposium on Chinese Spoken Language Processing, 354-357
@inproceedings{ni08b_iscslp, author={Chong-Jia Ni and Wen-Ju Liu and Bo Xu}, title={{Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information}}, year=2008, booktitle={Proc. International Symposium on Chinese Spoken Language Processing}, pages={354--357} }