EUROSPEECH '97
5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997


A Study of Multilingual Speech Recognition

Fuliang Weng, Harry Bratt, Leonardo Neumeyer, Andreas Stolcke

Speech Technology and Research Laboratory, SRI International, Menlo Park, CA, USA

This paper describes our work in developing multilingual (Swedish and English) speech recognition systems in the ATIS domain. The acoustic component of the multilingual systems is realized through sharing Gaussian codebooks across Swedish and English allophones. The language model (LM) components are constructed by training a statistical bigram model, with a common backoff node, on bilingual texts, and by combining two monolingual LMs into a probabilistic finite state grammar. This system uses a single decoder for Swedish and English sentences, and is capable of recognizing sentences with words from both languages. Preliminary experiments show that sharing acoustic models across the two languages has not resulted in improved performance, while sharing a backoff node at the LM component provides flexibility and ease in recognizing bilingual sentences at the expense of a slight increase in word error rate in some cases. As a by-product, the bilingual decoder also achieves good performance on language identification (LID).

Full Paper

Bibliographic reference.  Weng, Fuliang / Bratt, Harry / Neumeyer, Leonardo / Stolcke, Andreas (1997): "A study of multilingual speech recognition", In EUROSPEECH-1997, 359-362.