Symposium on Machine Learning in Speech and Language Processing (MLSLP)

Portland, Oregon, USA
September 14, 2012

Boosting Localized Binary Features for Speech Recognition

Anindya Roy (1), Mathew Magimai-Doss (2), Sébastien Marcel (2)

(1) Spoken Language Processing group, LIMSI-CNRS, Orsay, France
(2) Idiap Research Institute, Martigny, Switzerland

In a recent work, the framework of Boosted Binary Features (BBF) was proposed for ASR. In this framework, a small set of localized binary-valued features are selected using the Discrete Adaboost algorithm. These features are then integrated into a standard HMM-based system using either single layer perceptrons (SLP) or multilayer perceptrons (MLP). The features were found to perform significantly better (when coupled with SLP) and equally well (when coupled with MLP) compared to MFCC features on the TIMIT phoneme recognition task. The current work presents an overview of the idea and extends it in two directions: 1) fusion of BBF withMFCC and an analysis of their complementarity, 2) scalability of the proposed features from phoneme recognition to the continuous speech recognition task and reusability on unseen data.

Index Terms: Boosting, localized features, spectrotemporal features, speech recognition, feature fusion.

Full Paper    

Bibliographic reference.  Roy, Anindya / Magimai-Doss, Mathew / Marcel, Sébastien (2012): "Boosting localized binary features for speech recognition", In MLSLP-2012, 18-21.