INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Optimizing Regression for In-Car Speech Recognition using Multiple Distributed Microphones

Weifeng Li, Fumitada Itakura, Kazuya Takeda

Nagoya University, Japan

In this paper, we address issues in improving hands-free speech recognition performance in different car environments using multiple spatially distributed microphones. In previous work, we proposed multiple regression of the log-spectra (MRLS) for estimating the log-spectra of speech at a close-talking microphone. In this paper, the idea is extended to nonlinear regressions. Isolated word recognition experiments under real car environments show that, compared to the nearest distant microphone, recognition accuracies could be improved by about 40% for very noisy driving conditions by using the optimizing regression method, The proposed approach outperforms linear regression methods and also outperforms adaptive beamformer by 8% and 3% respectively in terms of averaged recognition accuracies.

Full Paper

Bibliographic reference.  Li, Weifeng / Itakura, Fumitada / Takeda, Kazuya (2004): "Optimizing regression for in-car speech recognition using multiple distributed microphones", In INTERSPEECH-2004, 2689-2692.