International Symposium on Chinese Spoken Language Processing
August 23-24, 2002
Efficient Phone Based Recognition Engines for Chinese and English Isolated Command Applications
Xavier Menendez-Pidal, Lei Duan, Jingwen Lu, Beatriz Dukes, Mike Emonts, Gustavo Hernandez-Abrego, Lex Olorenshaw
Spoken Language Technology Group, SONY NSCA, San
Jose, CA, USA
In this paper we present a flexible and efficient
approach to perform an accurate speech recognition
interface for isolated command applications in three
different languages: Mandarin, Cantonese and
English. The paper analyzes and discusses the
different trade-offs necessary to obtain an accurate,
real-time system with low memory requirements.
Areas addressed are design of the training database,
and Hidden Markov Model (HMM) units used by the
recognizer (monophones versus triphones).
Menendez-Pidal, Xavier / Duan, Lei / Lu, Jingwen / Dukes, Beatriz / Emonts, Mike / Hernandez-Abrego, Gustavo / Olorenshaw, Lex (2002):
"Efficient phone based recognition engines for Chinese and English isolated command applications",
In ISCSLP 2002, paper 32.