An unsupervised joint prosody labeling and modeling (PLM) method for exploring the prosody of spontaneous Mandarin speech is proposed. It is designed to automatically label a speech corpus and construct prosodic models simultaneously. Experimental results on a large dialog corpus confirmed its effectiveness. Many meaningful characteristics of spontaneous-speech prosody were investigated from the parameters of the well-trained prosodic models. The prosodic feature patterns of high-level constituents of the postulated prosody hierarchy were derived. An analysis of disfluencies related to the labeling results was also discussed. Those findings would provide rich prosodic information for various speech processing applications.
Cite as: Chou, Y.-L., Chiang, C.-Y., Wang, Y.-R., Yu, H.-M., Chen, S.-H. (2010) Prosody labeling and modeling for Mandarin spontaneous speech. Proc. Speech Prosody 2010, paper 087
@inproceedings{chou10_speechprosody, author={Yu-Lun Chou and Chen-Yu Chiang and Yih-Ru Wang and Hsiu-Min Yu and Sin-Horng Chen}, title={{Prosody labeling and modeling for Mandarin spontaneous speech}}, year=2010, booktitle={Proc. Speech Prosody 2010}, pages={paper 087} }