International Symposium on Chinese Spoken Language Processing (ISCSLP 2002)

Taipei, Taiwan
August 23-24, 2002

Efficient Phone Based Recognition Engines for Chinese and English Isolated Command Applications

Xavier Menendez-Pidal, Lei Duan, Jingwen Lu, Beatriz Dukes, Mike Emonts, Gustavo Hernandez-Abrego, Lex Olorenshaw

Spoken Language Technology Group, SONY NSCA, San Jose, CA, USA

In this paper we present a flexible and efficient approach to perform an accurate speech recognition interface for isolated command applications in three different languages: Mandarin, Cantonese and English. The paper analyzes and discusses the different trade-offs necessary to obtain an accurate, real-time system with low memory requirements. Areas addressed are design of the training database, and Hidden Markov Model (HMM) units used by the recognizer (monophones versus triphones).


Full Paper

Bibliographic reference.  Menendez-Pidal, Xavier / Duan, Lei / Lu, Jingwen / Dukes, Beatriz / Emonts, Mike / Hernandez-Abrego, Gustavo / Olorenshaw, Lex (2002): "Efficient phone based recognition engines for Chinese and English isolated command applications", In ISCSLP 2002, paper 32.