INTERSPEECH 2009
10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Hill-Climbing Feature Selection for Multi-Stream ASR

David Gelbart (1), Nelson Morgan (1), Alexey Tsymbal (2)

(1) ICSI, USA
(2) Siemens AG, Germany

We performed automated feature selection for multi-stream (i.e., ensemble) automatic speech recognition, using a hill-climbing (HC) algorithm that changes one feature at a time if the change improves a performance score. For both clean and noisy data sets (using the OGI Numbers corpus), HC usually improved performance on held out data compared to the initial system it started with, even for noise types that were not seen during the HC process. Overall, we found that using Opitzís scoring formula, which blends single-classifier word recognition accuracy and ensemble diversity, worked better than ensemble accuracy as a performance score for guiding HC in cases of extreme mismatch between the SNR of training and test sets.

Our noisy version of the Numbers corpus, our multi-layerperceptron- based Numbers ASR system, and our HC scripts are available online.

Full Paper

Bibliographic reference.  Gelbart, David / Morgan, Nelson / Tsymbal, Alexey (2009): "Hill-climbing feature selection for multi-stream ASR", In INTERSPEECH-2009, 2967-2970.