ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition

Tai-Hsuan Ho, Chin-Jung Liu, Herman Sun, Ming-Yi Tsai, Lin-Shan Lee

This paper presents a new approach to tone modeling for continuous Mandarin speech recognition. Mandarin tones provide rich information for speech recognition. In this paper, we treat the tone as an attribute of the final vowel part of a Mandarin syllable. Separate distributions are estimated for cepstral coefficients and pitch features respectively, and the phonetic state tied-mixture technique is exploited to achieve improved modeling. Several tying structures are investigated, and the results are compared with that without using tonal parameters. After integrating tone models, decent improvements can be achieved in large vocabulary continuous Mandarin speech recognition. Besides, this approach can be easily incorporated into the one-pass Viterbi search framework for practical implementation of Mandarin dictation system.


doi: 10.21437/Eurospeech.1999-215

Cite as: Ho, T.-H., Liu, C.-J., Sun, H., Tsai, M.-Y., Lee, L.-S. (1999) Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 883-886, doi: 10.21437/Eurospeech.1999-215

@inproceedings{ho99_eurospeech,
  author={Tai-Hsuan Ho and Chin-Jung Liu and Herman Sun and Ming-Yi Tsai and Lin-Shan Lee},
  title={{Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={883--886},
  doi={10.21437/Eurospeech.1999-215}
}