ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

EARLYZER: perceptualy motivated robust TFR of speech

J. V. Avadhanulu, M. Mathew, T. V. Sreenivas

Development of robust and efficient front-end is crucial for robust ASR. Proper time and frequency resolution of the TFR of speech, motivated by the auditory models is considered an important factor for robustness. An efficient method of realizing a variable resolution TFR using DTFT/Goertzel algorithm is proposed instead of the standard FFT based approach. It is shown that the new representation, called EarLyzer, is more robust than the FFT based Mel frequency cepstral coefficient representation for an automobile noisy speech recognition task.


doi: 10.21437/Eurospeech.1999-609

Cite as: Avadhanulu, J.V., Mathew, M., Sreenivas, T.V. (1999) EARLYZER: perceptualy motivated robust TFR of speech. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2765-2768, doi: 10.21437/Eurospeech.1999-609

@inproceedings{avadhanulu99_eurospeech,
  author={J. V. Avadhanulu and M. Mathew and T. V. Sreenivas},
  title={{EARLYZER: perceptualy motivated robust TFR of speech}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2765--2768},
  doi={10.21437/Eurospeech.1999-609}
}