ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Auditory model based optimization of MFCCs improves automatic speech recognition performance

Saikat Chatterjee, Christos Koniaris, W. Bastiaan Kleijn

Using a spectral auditory model along with perturbation based analysis, we develop a new framework to optimize a set of features such that it emulates the behavior of the human auditory system. The optimization is carried out in an off-line manner based on the conjecture that the local geometries of the feature domain and the perceptual auditory domain should be similar. Using this principle, we modify and optimize the static mel frequency cepstral coefficients (MFCCs) without considering any feedback from the speech recognition system. We show that improved recognition performance is obtained for any environmental condition, clean as well as noisy.


doi: 10.21437/Interspeech.2009-756

Cite as: Chatterjee, S., Koniaris, C., Kleijn, W.B. (2009) Auditory model based optimization of MFCCs improves automatic speech recognition performance. Proc. Interspeech 2009, 2987-2990, doi: 10.21437/Interspeech.2009-756

@inproceedings{chatterjee09_interspeech,
  author={Saikat Chatterjee and Christos Koniaris and W. Bastiaan Kleijn},
  title={{Auditory model based optimization of MFCCs improves automatic speech recognition performance}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2987--2990},
  doi={10.21437/Interspeech.2009-756}
}