We performed automated feature selection for multi-stream (i.e., ensemble) automatic speech recognition, using a hill-climbing (HC) algorithm that changes one feature at a time if the change improves a performance score. For both clean and noisy data sets (using the OGI Numbers corpus), HC usually improved performance on held out data compared to the initial system it started with, even for noise types that were not seen during the HC process. Overall, we found that using Opitzís scoring formula, which blends single-classifier word recognition accuracy and ensemble diversity, worked better than ensemble accuracy as a performance score for guiding HC in cases of extreme mismatch between the SNR of training and test sets.
Our noisy version of the Numbers corpus, our multi-layerperceptron- based Numbers ASR system, and our HC scripts are available online.
Bibliographic reference. Gelbart, David / Morgan, Nelson / Tsymbal, Alexey (2009): "Hill-climbing feature selection for multi-stream ASR", In INTERSPEECH-2009, 2967-2970.