11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

On the Potential of Channel Selection for Recognition of Reverberated Speech with Multiple Microphones

Martin Wolf, Climent Nadeu

Universitat Politècnica de Catalunya, Spain

The performance of ASR systems in a room environment with distant microphones is strongly affected by reverberation. As the degree of signal distortion varies among acoustic channels (i.e. microphones), the recognition accuracy can benefit from a proper channel selection. In this paper, we experimentally show that there exists a large margin for WER reduction by channel selection, and discuss several possible methods which do not require any a-priori classification. Moreover, by using a LVCSR task, a significant WER reduction is shown with a simple technique which uses a measure computed from the sub-band time envelope of the various microphone signals.

Bibliographic reference.  Wolf, Martin / Nadeu, Climent (2010): "On the potential of channel selection for recognition of reverberated speech with multiple microphones", In INTERSPEECH-2010, 574-577.