ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A stochastic polynomial tone model for continuous Mandarin speech

Yang Cao, Taiyi Huang, Bo Xu, Chengrong Li

In this paper, a stochastic polynomial tone model is presented for tone modeling in continuous mandarin speech. In this model, the pitch contour is described by a stochastic trajectory. The mean trajectory is represented by a polynomial function of normalized time while the variance is time varying. After that, an effective training and recognition algorithm is developed respectively. Also the problem of missing observation is discussed. Decision tree is employed to cluster the tone pattern variations, which are represented by proposed model. Many possible factors other than tone of neighboring syllables were taken into consideration when the decision tree was constructed. The experiments result shows that the tone recognition speed can increase more than 10 times while the recognition error rates decreased by 16% compared with traditional HMM tone model.


Cite as: Cao, Y., Huang, T., Xu, B., Li, C. (2000) A stochastic polynomial tone model for continuous Mandarin speech. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 674-677

@inproceedings{cao00b_icslp,
  author={Yang Cao and Taiyi Huang and Bo Xu and Chengrong Li},
  title={{A stochastic polynomial tone model for continuous Mandarin speech}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 674-677}
}