ISCA Archive ISCSLP 2008
ISCA Archive ISCSLP 2008

Prosody Study with Context-dependent Acoustic Models

Yue-Ning Hu, Min Chu

In this paper, we propose to study prosody with context-dependent acoustic models. We find that we can achieve better resolution on a specific aspect by training CDM with certain focus. For the tone recognition task, CDM with focus on tones should be used and it achieves 15.2% relative error reduction, when comparing with the traditional tri-phone models. For detecting prosody boundaries, CDM with focus on position should be used and the accuracy of prosodic word is 92.2%. CDMs are also used to visualize the f0 patterns of sentences with give contextual information. Such patterns are helpful to understand the interaction among contextual factors. Overall, CDMs are useful data source for various prosody studies. Index Terms—context-dependent model, model focus, prosody study, tone recognition, phrase boundary detection


Cite as: Hu, Y.-N., Chu, M. (2008) Prosody Study with Context-dependent Acoustic Models. Proc. International Symposium on Chinese Spoken Language Processing, 57-60

@inproceedings{hu08b_iscslp,
  author={Yue-Ning Hu and Min Chu},
  title={{Prosody Study with Context-dependent Acoustic Models}},
  year=2008,
  booktitle={Proc. International Symposium on Chinese Spoken Language Processing},
  pages={57--60}
}