ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition

Donglai Zhu, Bin Ma, Haizhou Li

Discriminative training (DT) methods of acoustic models, such as SVM and MMI-training GMM, have been proved effective in spoken language recognition. In this paper we propose a DT method for GMM using the large margin (LM) estimation. Unlike traditional MMI or MCE methods, the LM estimation attempts to enhance the generalization ability of GMM to deal with new data that exhibits mismatch with training data. We define the multi-class separation margin as a function of GMM likelihoods, and derive update formulae of GMM parameters with the extended Baum-Welch algorithm. Results on the NIST language recognition evaluation (LRE) 2007 task show that the LM estimation achieves better performance and faster convergent speed than the MMI estimation.


doi: 10.21437/Interspeech.2009-621

Cite as: Zhu, D., Ma, B., Li, H. (2009) Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition. Proc. Interspeech 2009, 2179-2182, doi: 10.21437/Interspeech.2009-621

@inproceedings{zhu09_interspeech,
  author={Donglai Zhu and Bin Ma and Haizhou Li},
  title={{Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2179--2182},
  doi={10.21437/Interspeech.2009-621}
}