12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition

Francesco Nesta, Marco Matassoni, HariKrishna Maganti

FBK-irst, Italy

This demo presents a real-time prototype for automatic blind source extraction and speech recognition in presence of multiple interfering noise sources. Binaural recorded mixtures are processed by a combined Blind/Semi-Blind Source Separation algorithm in order to obtain an estimation of the target signal. The recovered target signal is segmented and used as input to a real-time automatic speech recognition (ASR) system. Further, to improve the recognition performance, noise robust features based on Gammatone filters are used. The demo utilizes the data provided for the CHiME Pascal speech separation and recognition challenge and also real-time mixtures recorded on-site. Users will be able to listen to the recovered target signal and compare it with the original mixture and ASR output.

Full Paper

Bibliographic reference.  Nesta, Francesco / Matassoni, Marco / Maganti, HariKrishna (2011): "Real-time prototype for integration of blind source extraction and robust automatic speech recognition", In INTERSPEECH-2011, 3343-3344.