ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Auditory model based speech recognition in noisy environment

Xiaoqing Yu, Wanggen Wan, Daniel P. K. Lun

This paper presents a new speech feature, the ASBF speech feature based on the mathematical model of inner ear of human auditory system. This new speech feature is extracted using both mathematical model of inner ear and primary auditory nerve processing model of human auditory system, and it can track the speech formants effectively. In the experiment, the performance of MFCC and the ASBF are compared in both clean and noisy environments using left-to-right CDHMM with 6 states and 5 Gaussian mixtures. The experimental result shows that the ASBF is much more robust to noise than MFCC. When only 5 dimension is used in ASBF vector, the recognition rate is approximately 38.6% higher than the traditional MFCC with 39 dimension in the condition of S/N=10dB with white noise.


doi: 10.21437/Eurospeech.2001-162

Cite as: Yu, X., Wan, W., Lun, D.P.K. (2001) Auditory model based speech recognition in noisy environment. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 611-614, doi: 10.21437/Eurospeech.2001-162

@inproceedings{yu01_eurospeech,
  author={Xiaoqing Yu and Wanggen Wan and Daniel P. K. Lun},
  title={{Auditory model based speech recognition in noisy environment}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={611--614},
  doi={10.21437/Eurospeech.2001-162}
}