We propose a novel design for acoustic feature-based automatic spoken language recognizers. Our design is inspired by recent advances in text-independent speaker recognition, where intraclass variability is modeled by factor analysis in Gaussian mixture model (GMM) space. We use approximations to GMM-likelihoods which allow variable-length data sequences to be represented as statistics of fixed size. Our experiments on NIST LRE’07 show that variability-compensation of these statistics can reduce error-rates by a factor of three. Finally, we show that further improvements are possible with discriminative logistic regression training.
Bibliographic reference. Brümmer, Niko / Strasheim, Albert / Hubeika, Valiantsina / Matějka, Pavel / Burget, Lukáš / Glembek, Ondřej (2009): "Discriminative acoustic language recognition via channel-compensated GMM statistics", In INTERSPEECH-2009, 2187-2190.