10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Application of Noise Robust MDT Speech Recognition on the SPEECON and SpeechDat-Car Databases

J. F. Gemmeke (1), Y. Wang (2), Maarten Van Segbroeck (2), B. Cranen (1), Hugo Van hamme (2)

(1) Radboud Universiteit Nijmegen, The Netherlands
(2) Katholieke Universiteit Leuven, Belgium

We show that the recognition accuracy of an MDT recognizer which performs well on artificially noisified data, deteriorates rapidly under realistic noisy conditions (using multiple microphone recordings from the SPEECON/SpeechDat-Car databases) and is outperformed by a commercially available recognizer which was trained using a multi-condition paradigm. Analysis of the recognition results indicates that the recording channels with the lowest SNRs where the MDT recognizer fails most, are also the channels which suffer most from room reverberation. Despite the channel compensation measures we took, it appears difficult to maintain the restorative power of MDT in such non-additive noise conditions.

Full Paper

Bibliographic reference.  Gemmeke, J. F. / Wang, Y. / Segbroeck, Maarten Van / Cranen, B. / Van hamme, Hugo (2009): "Application of noise robust MDT speech recognition on the SPEECON and speechdat-car databases", In INTERSPEECH-2009, 1227-1230.