7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Multilingual Speech Recognition with Language Identification

Bin Ma1, Cuntai Guan (1), Haizhou Li (1), Chin-Hui Lee (2)

(1) InfoTalk Technology, Singapore; (2) National University of Singapore, Singapore

This paper presents a new approach to multilingual speech recognition. The proposed algorithm combines both language identification (LID) and speech recognition into a single process. It is shown to be effective for multilingual grammar-based speech recognition where the language information is not available prior to recognition. The idea is to make use of acoustic-phonetic and lexical information in each language to reduce possible mismatch caused by potential difference in acoustic and recording conditions when the training utterances for each language were collected. By doing so, it is shown that, with the help of LID information, the word error rate of a mixed Mandarin and English speech recognition system is greatly reduced. The same formulation can also be used to enhance language identi- fication accuracy.


Full Paper

Bibliographic reference.  Ma1, Bin / Guan, Cuntai / Li, Haizhou / Lee, Chin-Hui (2002): "Multilingual speech recognition with language identification", In ICSLP-2002, 505-508.