ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR

Eric H. C. Choi

This paper describes a novel and efficient noise-robust front-end that utilizes a set of Mel-filterbank output compensation methods, together with cumulative distribution mapping of cepstral coefficients, for noisy speech recognition. The proposed compensation framework includes the use of noise spectral subtraction, spectral flooring and log Mel-filterbank output weighting. Recognition experiments on the Aurora II connected digit database have revealed that the proposed front-end achieves an average digit recognition accuracy of 83.46% for a model set trained from clean data. Compared with the recognition results obtained by using the ETSI standard Mel-cepstral front-end, these results represent a relative error reduction of around 58%.


doi: 10.21437/Interspeech.2005-222

Cite as: Choi, E.H.C. (2005) A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR. Proc. Interspeech 2005, 933-936, doi: 10.21437/Interspeech.2005-222

@inproceedings{choi05b_interspeech,
  author={Eric H. C. Choi},
  title={{A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={933--936},
  doi={10.21437/Interspeech.2005-222}
}