Sixth European Conference on Speech Communication and Technology
Using TI digits recognition experiments, we show that a combination of two dynamic speech features, Liftered Forward Masked (LFM) MFCC and 2-D cepstrum, can improve system robustness to additive Volvo noise while maintaining system per-formance comparable to standard MFCC features in clean conditions. Through experiments, we show that the information extracted by forward masking and by the 2D cepstrum are in some sense orthogonal. By combining the LFM MFCC and the 2-D cepstrum plus > 2-D cepstrum, we achieve a recognition rate above 90% on the TI connected digits task, even in additive Volvo noise condition with SNR as low as 0dB. This corresponds to a SNR gain over 30dB compared with standard MFCC plus dynamic and acceleration coefficients.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Yao, Kaisheng / Shi, Bertram / Fung, Pascale / Cao, Zhigang (1999): "Liftered forward masking procedure for robust digits recognition", In EUROSPEECH'99, 2873-2876.