4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
This paper presents the use of a variety of filters in the temporal trajectories of frequency band spectrum to extract speech recognition features for environmental robustness. Three kind of filters for emphasizing the statistically important parts of speech are proposed. First, a bank of RASTA-like band-pass filters to fit the statistical peaks of modulation frequency band spectrum of speech are used. Secondly, a three-channel octave band-filter band with a smoothed rectangular window spline is applied. Thirdly, a data- driven filter is developed. Experimental results show that significant improvements for speech recognition using the proposed feature extraction approach under noisy environments can be achieved.
Bibliographic reference. Shen, Jia-lin / Hwang, Wen-liang / Lee, Lin-shan (1996): "Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum", In ICSLP-1996, 881-884.