ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A robust training strategy against extraneous acoustic variations for spontaneous speech recognition

Hui Jiang, Li Deng

In the paper, we propose a robust training strategy to deal with extraneous acoustic variations for conversational speech recognition. This strategy generalizes speaker adaptive training, where HMM parameter transformations are used to normalize the extraneous variations in the training data according to a set of pre-defined conditions. Then a compact model and the associated prior p.d.f.’s of transformation parameters are estimated using the maximum likelihood criterion. In the testing phase, the compact model and the prior p.d.f.’s are used to search for the unknown word sequence based on Bayesian Prediction Classification. The proposed strategy is evaluated in a Switchboard task to deal with pronunciation variations in spontaneous speech recognition. Preliminary results show moderate word error rate reduction over a well-trained baseline system under identical experimental conditions.


Cite as: Jiang, H., Deng, L. (2000) A robust training strategy against extraneous acoustic variations for spontaneous speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 161-164

@inproceedings{jiang00c_icslp,
  author={Hui Jiang and Li Deng},
  title={{A robust training strategy against extraneous acoustic variations for spontaneous speech recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 161-164}
}