Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot

IDIAP Research Institute, Switzerland

This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. A first application to sector-based, joint audio source localization and detection, using multiple microphones, confirms that the model can provide major enhancement. A second application to the single channel speech recognition task in a noisy environment yields major improvement on stationary noise and promising results on non-stationary noise.

Full Paper

Bibliographic reference.  Lathoud, Guillaume / Magimai-Doss, Mathew / Mesot, Bertrand (2005): "A spectrogram model for enhanced source localization and noise-robust ASR", In INTERSPEECH-2005, 2345-2348.