Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Residual Noise Compensation by a Sequential EM Algorithm for Robust Speech Recognition in Nonstationary Noise

Kaisheng Yao (1), Bertram E. Shi (2), Satoshi Nakamura (1), Zhigang Cao (3)

(1) ATR Spoken Language Translation Research Laboratories, Kyoto, Japan
(2) Department of Electrical and Electronic Engineering, Hong Kong University of Science and Technology, Hong Kong
(3) Department of Electronic Engineering, Tsinghua University, Beijing, China

We model noise as a stationary component plus a time varying residual. The stationary part is estimated off-line and compensated using Log-Add noise compensation. The time varying residual is estimated and compensated using a sequential EM algorithm. The residual noise compensation proceeds in parallel with the recognition process. Experimental results demonstrate that the proposed algorithm improves the recognition performance not only in highly nonstationary noise but also in slow-varying noise, compared with Log-Add noise compensation alone.


Full Paper

Bibliographic reference.  Yao, Kaisheng / Shi, Bertram E. / Nakamura, Satoshi / Cao, Zhigang (2000): "Residual noise compensation by a sequential EM algorithm for robust speech recognition in nonstationary noise", In ICSLP-2000, vol.1, 770-773.