12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

A Level-Dependent Auditory Filter-Bank for Speech Recognition in Reverberant Environments

HariKrishna Maganti, Marco Matassoni

FBK-irst, Italy

Distortions due to reverberation have detrimental effect on the performance of automatic speech recognition (ASR). In this work, an auditory filter-bank based feature is presented to improve the ASR in reverberant conditions. The proposed technique is based on the gammachirp filter bank which provides level dependent frequency response to emulate mechanisms performed in the human auditory system, particularly basilar membrane filtering aimed to improve robustness of the ear. The low frequency tail of gammachirp filter which is unaffected by bandwidth parameters due to level dependency frequency resolution is effective in reducing the reverberation distortions. Experiments are performed on the Aurora-5 meeting recorder digit task recorded with four different microphones in hands-free mode at a real meeting room. The ASR experiments using the proposed gammachirp based features show reliable and consistent improvements when compared to other conventional feature extraction techniques.

Full Paper

Bibliographic reference.  Maganti, HariKrishna / Matassoni, Marco (2011): "A level-dependent auditory filter-bank for speech recognition in reverberant environments", In INTERSPEECH-2011, 685-688.