7th International Conference on Spoken Language Processing
September 16-20, 2002
We present a highly effective and extremely simple noise-robust front end based on novel post-processing of standard MFCC features. It performs remarkably well on the Aurora 2.0 noisy-digits database without requiring any increase in model complexity. Compared to the Aurora 2.0 baseline system, our technique improves the average word error rate by 45% in the multi-condition training case, (matched training/testing conditions) and 60% in the clean training case (mismatched training/testing conditions) - this is an improvement that rivals some of the best known results on this database. Our method, moreover, improves the performances in all cases, regardless of clean or noisy speech, matched or mismatched environments. Our technique is entirely general because it makes no assumptions about the existence, type, or level of noise in the speech signal. Moreover, its simplicity means that it should be easy to integrate with other techniques in order to yield further improvements.
Bibliographic reference. Chen, Chia-Ping / Bilmes, Jeff A. / Kirchhoff, Katrin (2002): "Low-resource noise-robust feature post-processing on Aurora 2.0", In ICSLP-2002, 2445-2448.