10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Optimization of Dereverberation Parameters Based on Likelihood of Speech Recognizer

Randy Gomez, Tatsuya Kawahara

Kyoto University, Japan

Speech recognition under reverberant condition is a difficult task. Most dereverberation techniques used to address this problem enhance the reverberant waveform independent from that of the speech recognizer. In this paper, we improve the conventional Spectral Subtraction-based (SS) dereverberation technique. In our proposed approach, the dereverberation parameters are optimized to improve the likelihood of the acoustic model. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training. Additional optimization is also implemented during decoding of the test utterances. We have evaluated using real reverberant data and experimental results show that the proposed method significantly improves the recognition performance over the conventional approach.

Full Paper

Bibliographic reference.  Gomez, Randy / Kawahara, Tatsuya (2009): "Optimization of dereverberation parameters based on likelihood of speech recognizer", In INTERSPEECH-2009, 1223-1226.