Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Optimized Subspace Weighting for Robust Speech Recognition in Additive Noise Environments

Kris Hermus, Werner Verhelst, Patrick Wambacq

Katholieke Universiteit Leuven - ESAT/PSI, Leuven, Belgium

Signal Subspace (SS) based speech enhancement techniques obtain significant additive-noise reduction by altering the singular value spectrum of the speech observation matrix. Among the class of different possible SS weighting strategies, the Minimum Variance (MV) estimation method substantially increases the speech recognition accuracy in additive noise environments, outperforming the widely used Spectral Subtraction methods. However, these SS approaches are developed as pure speech enhancement techniques, and it is still unknown how effective they are for noise robust speech recognition. In this respect, we present the idea of 'optimal SS weighting' for speech recognition systems, and we illustrate in detail that the MV estimation closely approximates this optimum. We applied the SS weighting methods to a LV-CSR task with noisy data (10 dB SNR), and obtained relative reductions in Word Error Rate of more than 60 %.

Full Paper

Bibliographic reference.  Hermus, Kris / Verhelst, Werner / Wambacq, Patrick (2000): "Optimized subspace weighting for robust speech recognition in additive noise environments", In ICSLP-2000, vol.3, 542-545.