Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

Liftered Forward Masking Procedure for Robust Digits Recognition

Kaisheng Yao (1,2), Bertram Shi (2), Pascale Fung (2), Zhigang Cao (1)

(1) Department of Electronic Engineering, Tsinghua University, Beijing, China
(2) Human Language Technology Center, Department of Electrical and Electronic Engineering, University of Science and Technology, HKUST, Clear Water Bay, Hong Kong

Using TI digits recognition experiments, we show that a combination of two dynamic speech features, Liftered Forward Masked (LFM) MFCC and 2-D cepstrum, can improve system robustness to additive Volvo noise while maintaining system per-formance comparable to standard MFCC features in clean conditions. Through experiments, we show that the information extracted by forward masking and by the 2D cepstrum are in some sense orthogonal. By combining the LFM MFCC and the 2-D cepstrum plus > 2-D cepstrum, we achieve a recognition rate above 90% on the TI connected digits task, even in additive Volvo noise condition with SNR as low as 0dB. This corresponds to a SNR gain over 30dB compared with standard MFCC plus dynamic and acceleration coefficients.


Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Yao, Kaisheng / Shi, Bertram / Fung, Pascale / Cao, Zhigang (1999): "Liftered forward masking procedure for robust digits recognition", In EUROSPEECH'99, 2873-2876.