ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Analysis of physiologically-motivated signal processing for robust speech recognition

Yu-Hsiang Bosco Chiu, Richard M. Stern

This paper discusses the relative impact that different stages of a popular auditory model have on improving the accuracy of automatic speech recognition in the presence of additive noise. Recognition accuracy is measured using the CMU SPHINX-III speech recognition system, and the DARPA Resource Management speech corpus for training and testing. It is shown that feature extraction based on auditory processing provides better performance in the presence of additive background noise than traditional MFCC processing and it is argued that an expansive nonlinearity in the auditory model contributes the most to noise robustness.


doi: 10.21437/Interspeech.2008-291

Cite as: Chiu, Y.-H.B., Stern, R.M. (2008) Analysis of physiologically-motivated signal processing for robust speech recognition. Proc. Interspeech 2008, 1000-1003, doi: 10.21437/Interspeech.2008-291

@inproceedings{chiu08_interspeech,
  author={Yu-Hsiang Bosco Chiu and Richard M. Stern},
  title={{Analysis of physiologically-motivated signal processing for robust speech recognition}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1000--1003},
  doi={10.21437/Interspeech.2008-291}
}