8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Online Minimum Mean Square Error Filtering of Noisy Cepstral Coefficients Using a Sequential EM Algorithm

Tor Andre Myrvoll (1), Satoshi Nakamura (2)

(1) Norwegian University of Science and Technology, Norway
(2) Advanced Telecommunications Research Institute International, Japan

In this work we propose an online filtering algorithm that aims to alleviate the decrease we see in ASR performance when the speech is corrupted by additive noise. Using an initial estimate of the noise distribution, the algorithm updates the noise model on a frame synchronous basis. The minimum mean square error (MMSE) filtering is also performed at a frame per frame basis, using the most current noise model estimate at all times. The algorithm is compared to a batch version which uses several iterations of the EM-algorithm over the complete utterance to estimate the noise model, and it is demonstrated that the performance is as good or better at a fraction of the computational complexity when the noise is non-stationary.

Full Paper

Bibliographic reference.  Myrvoll, Tor Andre / Nakamura, Satoshi (2004): "Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm", In INTERSPEECH-2004, 117-120.