8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

In-phase Feature Induction: An Effective Compensation Technique for Robust Speech Recognition

Siu Wa Lee, Pak Chung Ching

The Chinese University of Hong Kong, Hong Kong

The performance of most standard Automatic Speech Recognition (ASR) systems degrades severely under noisy environments, because they are usually trained with clean speech data. There exist several speech enhancement or compensation schemes that can improve the robustness of ASR to different levels. In this paper, an effective feature compensation method called In-phase Feature Induction (IFI) is proposed. It makes use of the phase relationship between the spectra of noisy input and the corresponding noise signal to accurately obtain the clean speech spectrum. Experimental results show that recognition systems with IFI compensation yield much higher accuracy rates under various noisy conditions compared with the Spectral Subtraction based method.

Full Paper

Bibliographic reference.  Lee, Siu Wa / Ching, Pak Chung (2004): "In-phase feature induction: an effective compensation technique for robust speech recognition", In INTERSPEECH-2004, 157-160.