11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Single-Channel Speech Enhancement Using Kalman Filtering in the Modulation Domain

Stephen So, Kamil K. Wójcicki, Kuldip K. Paliwal

Griffith University, Australia

In this paper, we propose the modulation-domain Kalman filter (MDKF) for speech enhancement. In contrast to previous modulation domain-enhancement methods based on bandpass filtering, the MDKF is an adaptive and linear MMSE estimator that uses models of the temporal changes of the magnitude spectrum for both speech and noise. Also, because the Kalman filter is a joint magnitude and phase spectrum estimator, under non-stationarity assumptions, it is highly suited for modulation-domain processing, as modulation phase tends to contain more speech information than acoustic phase. Experimental results from the NOIZEUS corpus show the ideal MDKF (with clean speech parameters) to outperform all the acoustic and time-domain enhancement methods that were evaluated, including the conventional time-domain Kalman filter with clean speech parameters. A practical MDKF that uses the MMSE-STSA method to enhance noisy speech in the acoustic domain prior to LPC analysis was also evaluated and showed promising results.

Full Paper

Bibliographic reference.  So, Stephen / Wójcicki, Kamil K. / Paliwal, Kuldip K. (2010): "Single-channel speech enhancement using kalman filtering in the modulation domain", In INTERSPEECH-2010, 993-996.