8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003


A Segment-Based Algorithm of Speech Enhancement for Robust Speech Recognition

Guokang Fu (1), Ta-Hsin Li (2)

(1) IBM China Research Lab, China
(2) IBM T.J. Watson Research Center, USA

Accurate recognition of speech in noisy environment is still an obstacle for wider application of speech recognition technology. Noise reduction, which is aimed at cleaning the corrupted testing signal to match the ideal training conditions, remain to be an effective approach to improving the accuracy of speech recognition in noisy environment. This paper introduces a new algorithm of noise reduction that combines a tree-based segmentation method with the maximum likelihood estimation to accommodate the nonstationarity of speech while efficiently suppressing the possibly nonstationary noise. Numerical results are obtained from the experiments on an speech recognition system, showing the effectiveness of the proposed algorithm in improving the accuracy of Chinese speech recognition.

Full Paper

Bibliographic reference.  Fu, Guokang / Li, Ta-Hsin (2003): "A segment-based algorithm of speech enhancement for robust speech recognition", In EUROSPEECH-2003, 3029-3032.