We have evaluated several feature-based and a model-based method for robust speech recognition in noise. The evaluation was performed on Aurora 2 task. We show that after a sub-band based spectral subtraction, features can be more robust to additive noise. We also report a robust feature set derived from differential power spectrum (DPS), which is not only robust to additive noise, but also robust to spectrum colorization due to channel effects. When the clean training set is available, we show that a model-based noise compensation method can be effective to improve system robustness to noise. Given the testing sets, as a whole, the feature-based methods can yield about 22% relative improvement in accuracy for multi-condition training task, and the model-based method can have about 63% relative performance improvement when systems were trained on clean training set.
Cite as: Yao, K., Chen, J., Paliwal, K.K., Nakamura, S. (2001) Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 233-236, doi: 10.21437/Eurospeech.2001-81
@inproceedings{yao01_eurospeech, author={Kaisheng Yao and Jingdong Chen and Kuldip K. Paliwal and Satoshi Nakamura}, title={{Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task}}, year=2001, booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)}, pages={233--236}, doi={10.21437/Eurospeech.2001-81} }