ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Aggregated cross-validation and its efficient application to Gaussian mixture optimization

Takahiro Shinozaki, Sadaoki Furui, Tatsuya Kawahara

We have previously proposed a cross-validation (CV) based Gaussian mixture optimization method that efficiently optimizes the model structure based on CV likelihood. In this study, we propose aggregated cross-validation (AgCV) that introduces a bagging-like approach in the CV framework to reinforce the model selection ability. While a single model is used in CV to evaluate a held-out subset, AgCV uses multiple models to reduce the variance in the score estimation. By integrating AgCV instead of CV in the Gaussian mixture optimization algorithm, an AgCV likelihood based Gaussian mixture optimization algorithm is obtained. The algorithm works efficiently by using sufficient statistics and can be applied to large models such as Gaussian mixture HMM. The proposed algorithm is evaluated by speech recognition experiments on oral presentations and it is shown that lower word error rates are obtained by the AgCV optimization method when compared to CV and MDL based methods.


doi: 10.21437/Interspeech.2008-124

Cite as: Shinozaki, T., Furui, S., Kawahara, T. (2008) Aggregated cross-validation and its efficient application to Gaussian mixture optimization. Proc. Interspeech 2008, 2382-2385, doi: 10.21437/Interspeech.2008-124

@inproceedings{shinozaki08_interspeech,
  author={Takahiro Shinozaki and Sadaoki Furui and Tatsuya Kawahara},
  title={{Aggregated cross-validation and its efficient application to Gaussian mixture optimization}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2382--2385},
  doi={10.21437/Interspeech.2008-124}
}