Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Multi-Resolution RASTA Filtering for TANDEM-Based ASR

Hynek Hermansky (1), Petr Fousek (2)

(1) IDIAP Research Institute, Switzerland; (2) Czech Technical University in Prague, Czech Republic

New speech representation based on multiple filtering of temporal trajectories of speech energies in frequency sub-bands is proposed and tested. The technique extends earlier works on delta features and RASTA filtering by processing temporal trajectories by a bank of band-pass filters with varying resolutions. In initial tests on OGI Digits database the technique yields about 30% relative improvement in word error rate over the conventional PLP features. Since the applied filters have zero-mean impulse responses, the technique is inherently robust to linear distortions.

Full Paper

Bibliographic reference.  Hermansky, Hynek / Fousek, Petr (2005): "Multi-resolution RASTA filtering for TANDEM-based ASR", In INTERSPEECH-2005, 361-364.