8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003


PPRLM Optimization for Language Identification in Air Traffic Control Tasks

R. Cordoba, G. Prime, J. Macias-Guarasa, J.M. Montero, J. Ferreiros, J.M. Pardo

Universidad Politecnica de Madrid, Spain

In this paper, we present the work done in language identification for two air traffic control speech recognizers, one for continuous speech and the other one for a command interface. The system is able to distinguish between Spanish and English. We will confirm the advantage of using PPRLM over PRLM. All previous studies show that PPRLM is the technique with the best performance despite of its drawbacks: more processing time and labeled data is needed. No work has been published regarding the optimum weights which should be given to the language models to optimize the performance of the language recognizer. This paper addresses this topic, providing three different approaches for weight selection in the language model score. We will also see that a trigram language model improves performance. The final results are very good even with very short segments of speech.

Full Paper

Bibliographic reference.  Cordoba, R. / Prime, G. / Macias-Guarasa, J. / Montero, J.M. / Ferreiros, J. / Pardo, J.M. (2003): "PPRLM optimization for language identification in air traffic control tasks", In EUROSPEECH-2003, 2685-2688.