ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition

Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda

This paper proposes a prior distribution determination technique using cross validation for speech recognition based on the Bayesian approach. The Bayesian method is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters and its approximate version, the variational Bayesian method has been applied to HMM-based speech recognition. Since prior distributions representing prior information about model parameters affect the posterior distributions and model selection, the determination of prior distributions is an important problem. However, it has not been thoroughly investigate in speech recognition. The proposed method can determine reliable prior distributions without tuning parameters and select an appropriate model structure dependently on the amount of training data. Continuous phoneme recognition experiments show that the proposed method achieved a higher performance than the conventional methods.


doi: 10.21437/Interspeech.2008-112

Cite as: Hashimoto, K., Zen, H., Nankaku, Y., Lee, A., Tokuda, K. (2008) Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. Proc. Interspeech 2008, 936-939, doi: 10.21437/Interspeech.2008-112

@inproceedings{hashimoto08_interspeech,
  author={Kei Hashimoto and Heiga Zen and Yoshihiko Nankaku and Akinobu Lee and Keiichi Tokuda},
  title={{Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={936--939},
  doi={10.21437/Interspeech.2008-112}
}