13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Sparse Bayesian Factor Analysis for Stereo-based Stochastic Mapping

Xiaodong Cui (1), Mohamed Afify (2), George Saon (1), Vaibhava Goel (1)

(1) IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
(2) Orange Lab, Smart Village, Cairo, Egypt

This paper investigates a factor analysis scheme in the joint channel space of stereo-based stochastic mapping (SSM) for noise robust automatic speech recognition. A mixture of Bayesian factor analyzers is used to describe the generative factors in the multi-conditional training scenario in terms of noise type and signal-to-noise ratio. Sparsity-promoting prior is applied on the matrix of factor loadings to automatically learn the effective factors from a redundant dictionary in a particular soft cluster. Experiments carried out on large vocabulary continuous speech recognition tasks show that this sparse Bayesian factor analysis scheme leads to superior SSM performance for noise robustness.

Index Terms: Bayesian factor analysis, sparsity learning, stereo-based stochastic mapping, noise robust automatic speech recognition

Full Paper

Bibliographic reference.  Cui, Xiaodong / Afify, Mohamed / Saon, George / Goel, Vaibhava (2012): "Sparse Bayesian factor analysis for stereo-based stochastic mapping", In INTERSPEECH-2012, 795-798.