Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Acoustic Modeling and Language Modeling for Cantonese LVCSR

Y. W. Wong, K. F. Chow, Wai H. Lau, W. K. Lo, Tan Lee, P. C. Ching

Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong

This paper describes our recent work on the development of a large-vocabulary, speaker-independent continuous speech recognition system for Cantonese (a major Chinese dialect). Both acoustic modeling and language modeling are being addressed. For acoustic modeling, we focus on right-context-dependent sub-syllable units. Tying of HMM at model as well as state level is applied based on phonetic knowledge and the decision-tree approach. Statistical language model is built from large amount of newspaper text. The overall recognition accuracy for syllable and Chinese character are 81.83% and 68.94% respectively.

