ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Deterministic annealing based training algorithm for Bayesian speech recognition

Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda

This paper proposes a deterministic annealing based training algorithm for Bayesian speech recognition. The Bayesian method is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters. However, the local maxima problem in the Bayesian method is more serious than in the ML-based approach, because the Bayesian method treats not only state sequences but also model parameters as latent variables. The deterministic annealing EM (DAEM) algorithm has been proposed to improve the local maxima problem in the EM algorithm, and its effectiveness has been reported in HMM-based speech recognition using ML criterion. In this paper, the DAEM algorithm is applied to Bayesian speech recognition to relax the local maxima problem. Speech recognition experiments show that the proposed method achieved a higher performance than the conventional methods.


doi: 10.21437/Interspeech.2009-236

Cite as: Shiota, S., Hashimoto, K., Nankaku, Y., Tokuda, K. (2009) Deterministic annealing based training algorithm for Bayesian speech recognition. Proc. Interspeech 2009, 680-683, doi: 10.21437/Interspeech.2009-236

@inproceedings{shiota09_interspeech,
  author={Sayaka Shiota and Kei Hashimoto and Yoshihiko Nankaku and Keiichi Tokuda},
  title={{Deterministic annealing based training algorithm for Bayesian speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={680--683},
  doi={10.21437/Interspeech.2009-236}
}