ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Automatic speech recognition in Mandarin for embedded platforms

Fengguang Zhao, Prabhu Raghavan, Sunil K. Gupta, Ziyi Lu, Wentao Gu, Wentao Gu

In this paper, we describe a real-time automatic speech recognition system for Mandarin for low-cost embedded platforms using fixed-point digital signal processors. The hands-free, speaker-independent speech recognition system employs 41 mono-phone models for representing the sounds in Mandarin Chinese and 11 whole-word models for connected digit recognition. The system achieves greater than 98% recognition accuracy on our hands-free test database of 46 distinct command phrases. The system achieves 95.9% digit accuracy on a 14 speaker, hands-free, connected digit recognition database. The analysis of the results shows that for speakers without dialect, the digit recognition accuracy is almost 98%. We present a detailed analysis of the digit recognition results and propose further improvements. A realtime platform based upon Lucent’s DSP1627 fixed-point digital signal processor has been developed.


Cite as: Zhao, F., Raghavan, P., Gupta, S.K., Lu, Z., Gu, W., Gu, W. (2000) Automatic speech recognition in Mandarin for embedded platforms. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 815-818

@inproceedings{zhao00b_icslp,
  author={Fengguang Zhao and Prabhu Raghavan and Sunil K. Gupta and Ziyi Lu and Wentao Gu and Wentao Gu},
  title={{Automatic speech recognition in Mandarin for embedded platforms}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 815-818}
}