Odyssey 2010: The Speaker and Language Recognition Workshop

Brno, Czech Republic
28 June – 1 July 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system

Zdenek Jancik, Oldrich Plchot (1), Niko Brümmer (2), Lukas Burget, Ondrej Glembek, Valiantsina Hubeika, Martin Karafiat, Pavel Matejka, Tomas Mikolov (1), Albert Strasheim (2), Jan "Honza" Cernocky (1)

(1) Brno University of Technology, (2) Agnitio

This paper summarizes the BUT-AGNITIO system for NIST Language recognition evaluation 2009. The post-evaluation analysis aimed mainly at improving the quality of the data (fixing language label problems and detecting overlapping speakers in the training and development sets) and investigation of different compositions of the development set. The paper further investigates into JFA-based acoustic system and reports results for new SVM-PCA systems going beyond BUT-Agnitio original NIST LRE 2009 submission. All results are presented on evaluation data from NIST LRE 2009 task.

Full Paper (PDF)

Bibliographic reference.  Jancik, Zdenek / Plchot, Oldrich / Brümmer, Niko / Burget, Lukas / Glembek, Ondrej / Hubeika, Valiantsina / Karafiat, Martin / Matejka, Pavel / Mikolov, Tomas / Strasheim, Albert / Cernocky, Jan "Honza" (2010): "Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system", In Odyssey-2010, paper 037.