ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Speaker recognition based on variational Bayesian method

Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda

This paper presents a speaker identification system based on Gaussian Mixture Models (GMM) using the variational Bayesian method. Maximum Likelihood (ML) and Maximum A Posterior (MAP) are well-known methods for estimating GMM parameters. However, the overtraining problem occurs with insufficient data due to a point estimate of model parameters. The Bayesian approach estimates a posterior distribution of model parameters and achieves a more robust prediction than ML and MAP approach. To solve complicated integral calculations in the Bayesian approach, the variational Bayesian method has been proposed and applied to many classification problems using latent variable models. However, the performance of the Bayesian approach has not been extensively investigated in large speaker identification tasks. The experimental results shows that the VB method improves the overtraining problem than the conventional ML and MAP methods.


doi: 10.21437/Interspeech.2008-410

Cite as: Ito, T., Hashimoto, K., Nankaku, Y., Lee, A., Tokuda, K. (2008) Speaker recognition based on variational Bayesian method. Proc. Interspeech 2008, 1417-1420, doi: 10.21437/Interspeech.2008-410

@inproceedings{ito08b_interspeech,
  author={Tatsuya Ito and Kei Hashimoto and Yoshihiko Nankaku and Akinobu Lee and Keiichi Tokuda},
  title={{Speaker recognition based on variational Bayesian method}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1417--1420},
  doi={10.21437/Interspeech.2008-410}
}