14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Mixtures of Bayesian Joint Factor Analyzers for Noise Robust Automatic Speech Recognition

Xiaodong Cui, Vaibhava Goel, Brian Kingsbury

IBM T.J. Watson Research Center, USA

This paper investigates a noise robust approach to automatic speech recognition based on a mixture of Bayesian joint factor analyzers. In this approach, noisy features are modeled by two joint groups of factors accounting for speaker and noise variabilities which are estimated by clean and noisy speech respectively. The factors form an overcomplete dictionary with a redundant representation. Automatic relevance determination (ARD) is carried out by the relevance vector machine (RVM) where sparsity-promoting priors are applied on two factor loading matrices. Experiments on large vocabulary continuous speech recognition (LVCSR) tasks show good improvements by this approach.

Full Paper

Bibliographic reference.  Cui, Xiaodong / Goel, Vaibhava / Kingsbury, Brian (2013): "Mixtures of Bayesian joint factor analyzers for noise robust automatic speech recognition", In INTERSPEECH-2013, 3012-3016.