ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

On von-mises fisher mixture model in text-independent speaker identification

Jalil Taghia, Zhanyu Ma, Arne Leijon

This paper addresses text-independent speaker identification (SI) based on line spectral frequencies (LSFs). The LSFs are transformed to differential LSFs (ĢLSF) in order to exploit their boundary and ordering properties. We show that the square root of ĢLSF has interesting directional characteristics implying that their distribution can be modeled by a mixture of von-Mises Fisher (vMF) distributions. We analytically estimate the mixture model parameters in a fully Bayesian treatment by using variational inference. In the Bayesian inference, we can potentially determine the model complexity and avoid overfitting problem associated with conventional approaches based on the expectation maximization. The experimental results confirm the effectiveness of the proposed SI system.


doi: 10.21437/Interspeech.2013-418

Cite as: Taghia, J., Ma, Z., Leijon, A. (2013) On von-mises fisher mixture model in text-independent speaker identification. Proc. Interspeech 2013, 2499-2503, doi: 10.21437/Interspeech.2013-418

@inproceedings{taghia13_interspeech,
  author={Jalil Taghia and Zhanyu Ma and Arne Leijon},
  title={{On von-mises fisher mixture model in text-independent speaker identification}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2499--2503},
  doi={10.21437/Interspeech.2013-418}
}