ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

A study of bootstrapping with multiple acoustic features for improved automatic speech recognition

Xiaodong Cui, Jian Xue, Bing Xiang, Bowen Zhou

This paper investigates a scheme of bootstrapping with multiple acoustic features (MFCC, PLP and LPCC) to improve the overall performance of automatic speech recognition. In this scheme, a Gaussian mixture distribution is estimated for each type of feature resampled in each HMM state by single-pass retraining on a shared decision tree. Thus obtained acoustic models based on the multiple features are combined by likelihood averaging during decoding. Experiments on large vocabulary spontaneous speech recognition show its superior overall performance than the best of acoustic models from individual features. It also achieves comparable performance to Recognizer Output Voting Error Reduction (ROVER) with computational advantages.


doi: 10.21437/Interspeech.2009-85

Cite as: Cui, X., Xue, J., Xiang, B., Zhou, B. (2009) A study of bootstrapping with multiple acoustic features for improved automatic speech recognition. Proc. Interspeech 2009, 240-243, doi: 10.21437/Interspeech.2009-85

@inproceedings{cui09_interspeech,
  author={Xiaodong Cui and Jian Xue and Bing Xiang and Bowen Zhou},
  title={{A study of bootstrapping with multiple acoustic features for improved automatic speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={240--243},
  doi={10.21437/Interspeech.2009-85}
}