8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Language Identification Techniques based on Full Recognition in an Air Traffic Control Task

Ricardo de Córdoba, Javier Ferreiros, Valentin Sama, Javier Macias-Guarasa, Luis F. D'Haro, Fernando Fernandez

E.T.S.I. Telecomunicacion. Universidad Politecnica de Madrid, Spain

Automatic language identification has become an important issue in recent years in speech recognition systems. In this paper, we present the work done in language identification for an air traffic control speech recognizer for continuous speech. The system is able to distinguish between Spanish and English. We present several language identification techniques based on full recognition that improve the baseline results obtained using the most commonly known "PPRLM" technique. We have in our database some task specific critical problems for language identification like non native speakers, extremely spontaneous speech or Spanish-English mix in the same sentence. We confirm that PPRLM is quite sensible to those problems and that a technique based on a Bayesian classifier is the one with the best performance in spite of its higher computational cost.

Full Paper

Bibliographic reference.  Córdoba, Ricardo de / Ferreiros, Javier / Sama, Valentin / Macias-Guarasa, Javier / D'Haro, Luis F. / Fernandez, Fernando (2004): "Language identification techniques based on full recognition in an air traffic control task", In INTERSPEECH-2004, 1565-1568.