ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments

Hong Kook Kim, Richard C. Rose, Hong-Goo Kang

This paper presents a set of acoustic feature pre-processing techniques that are applied to improving automatic speech recognition (ASR) performance on the Aurora 2 noisy speech recognition task. The principal contribution of this paper is an approach for cepstrum domain feature compensation in ASR which is motivated by techniques for decomposing speech and noise that were originally developed for noisy speech enhancement. This approach is applied in combination with other feature compensation algorithms to compensating ASR features obtained from a mel-filterbank cepstrum coefficient (MFCC) front-end. Performance comparisons are made with respect to the application of the minimum mean squared error log spectral amplitude estimator (MMSELSA) based speech enhancement algorithm prior to feature analysis. An experimental study is presented where the feature compensation approaches described in the paper are found to reduce ASR word error rate by as much as 31% relative to uncompensated features under simulated environmental and channel mismatched conditions.


doi: 10.21437/Eurospeech.2001-113

Cite as: Kim, H.K., Rose, R.C., Kang, H.-G. (2001) Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 421-424, doi: 10.21437/Eurospeech.2001-113

@inproceedings{kim01b_eurospeech,
  author={Hong Kook Kim and Richard C. Rose and Hong-Goo Kang},
  title={{Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={421--424},
  doi={10.21437/Eurospeech.2001-113}
}