ISCA Archive ISCSLP 2008
ISCA Archive ISCSLP 2008

A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin

Fang-Zhou Liu, Hui-Bin Jia, Jian-Hua Tao

Modeling prosodic rhythm is of great importance for both speech synthesis and speech understanding, and it requires a large enough corpus with precise prosodic boundary labels. This paper proposes a maximum entropy (ME) based hierarchical model, which utilizes both text and acoustic features, to automatically label Mandarin prosodic boundaries. Results of comparative experiments show that, for the task of prosodic boundary detection, ME model obviously outperforms classification and regression tree (CART), and the bottom-up hierarchical framework is also significantly superior to the flat single-level framework. Index Terms—prosodic word, prosodic phrase, intonational phrase, maximum entropy


Cite as: Liu, F.-Z., Jia, H.-B., Tao, J.-H. (2008) A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin. Proc. International Symposium on Chinese Spoken Language Processing, 257-260

@inproceedings{liu08e_iscslp,
  author={Fang-Zhou Liu and Hui-Bin Jia and Jian-Hua Tao},
  title={{A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin}},
  year=2008,
  booktitle={Proc. International Symposium on Chinese Spoken Language Processing},
  pages={257--260}
}