ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Application of noise robust MDT speech recognition on the SPEECON and speechdat-car databases

J. F. Gemmeke, Y. Wang, Maarten Van Segbroeck, B. Cranen, Hugo Van hamme

We show that the recognition accuracy of an MDT recognizer which performs well on artificially noisified data, deteriorates rapidly under realistic noisy conditions (using multiple microphone recordings from the SPEECON/SpeechDat-Car databases) and is outperformed by a commercially available recognizer which was trained using a multi-condition paradigm. Analysis of the recognition results indicates that the recording channels with the lowest SNRs where the MDT recognizer fails most, are also the channels which suffer most from room reverberation. Despite the channel compensation measures we took, it appears difficult to maintain the restorative power of MDT in such non-additive noise conditions.


doi: 10.21437/Interspeech.2009-354

Cite as: Gemmeke, J.F., Wang, Y., Segbroeck, M.V., Cranen, B., Van hamme, H. (2009) Application of noise robust MDT speech recognition on the SPEECON and speechdat-car databases. Proc. Interspeech 2009, 1227-1230, doi: 10.21437/Interspeech.2009-354

@inproceedings{gemmeke09_interspeech,
  author={J. F. Gemmeke and Y. Wang and Maarten Van Segbroeck and B. Cranen and Hugo {Van hamme}},
  title={{Application of noise robust MDT speech recognition on the SPEECON and speechdat-car databases}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1227--1230},
  doi={10.21437/Interspeech.2009-354}
}