ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Anchor-model fusion for language recognition

Ignacio Lopez-Moreno, Daniel Ramos, Joaquin Gonzalez-Rodriguez, Doroteo T. Toledano

State-of-the-art language recognition systems usually combine multiple acoustic and phonotactic subsystems. The outputs of those systems are usually fused in different ways but the score from a trial is always obtained from N scores from N subsystems. In this paper, a robust novel approach to subsystem fusion in language recognition is proposed based on the relative performance of each trial not just to the claimed model but to all available models. The proposed technique exploits the relative behavior of a given speech utterance over the cohort of anchor models from the different subsystems, resulting in the proposed anchor-model fusion. Experiments fusing seven phone-SVM subsystems submitted by the authors to NIST LRE 2007 assess the robustness to non-uniform data availability over rule-based and trained fusion schemes as linear kernel SVM, as well as significant improvements in performance both in average EER and Cavg as used in NIST LRE.

doi: 10.21437/Interspeech.2008-227

Cite as: Lopez-Moreno, I., Ramos, D., Gonzalez-Rodriguez, J., Toledano, D.T. (2008) Anchor-model fusion for language recognition. Proc. Interspeech 2008, 727-730, doi: 10.21437/Interspeech.2008-227

  author={Ignacio Lopez-Moreno and Daniel Ramos and Joaquin Gonzalez-Rodriguez and Doroteo T. Toledano},
  title={{Anchor-model fusion for language recognition}},
  booktitle={Proc. Interspeech 2008},