12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

The Albayzin 2010 Language Recognition Evaluation

Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez, Germán Bordel

Universidad del País Vasco, Spain

The Albayzin 2010 Language Recognition Evaluation (LRE), carried out from June to October 2010, was the second effort made by the Spanish/Portuguese community for benchmarking language recognition technology. As the Albayzin 2008 LRE, it was coordinated by the Software Technology Working Group of the University of the Basque Country, with the support of the Spanish Thematic Network on Speech Technology. A speech database was created for system development and evaluation. Speech signals were recorded from TV broadcasts, including clean and noisy speech. The task consisted in deciding whether or not a target language was spoken in a test utterance, and involved 6 target languages: English, Portuguese and the four official languages in Spain (Basque, Catalan, Galician and Spanish), other (Out-Of-Set) languages being also recorded to allow open-set verification tests. This paper presents the main features of the evaluation, analyses system performance on different conditions, including the confusion among languages, and gives hints for future evaluations.

Full Paper

Bibliographic reference.  Rodriguez-Fuentes, Luis Javier / Penagarikano, Mikel / Varona, Amparo / Diez, Mireia / Bordel, Germán (2011): "The Albayzin 2010 language recognition evaluation", In INTERSPEECH-2011, 1529-1532.