ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Multi-resolution RASTA filtering for TANDEM-based ASR

Hynek Hermansky, Petr Fousek

New speech representation based on multiple filtering of temporal trajectories of speech energies in frequency sub-bands is proposed and tested. The technique extends earlier works on delta features and RASTA filtering by processing temporal trajectories by a bank of band-pass filters with varying resolutions. In initial tests on OGI Digits database the technique yields about 30% relative improvement in word error rate over the conventional PLP features. Since the applied filters have zero-mean impulse responses, the technique is inherently robust to linear distortions.

doi: 10.21437/Interspeech.2005-184

Cite as: Hermansky, H., Fousek, P. (2005) Multi-resolution RASTA filtering for TANDEM-based ASR. Proc. Interspeech 2005, 361-364, doi: 10.21437/Interspeech.2005-184

  author={Hynek Hermansky and Petr Fousek},
  title={{Multi-resolution RASTA filtering for TANDEM-based ASR}},
  booktitle={Proc. Interspeech 2005},