5th International Conference on Spoken Language Processing
This paper studies the reliance of a Gaussian Mixture Model (GMM) based closed-set Speaker Identification system on model convergence and describes methods to improve this convergence. It shows that the reason why the Vector Quantisation GMMs (VQGMMs) outperform a simple GMM is mainly due to decreasing the complexity of the data during training. In addition, it is shown that the VQGMM system is less computationally complex than the traditional GMM, yielding a system which is quicker to train and which gives higher performance. We also investigate four different VQ distance measures which can be used in the training of a VQGMM and compare their respective performances. It is found that the improvements gained by the VQGMM is only marginally dependant on the distance measure.
Bibliographic reference. Moody, James / Slomka, Stefan / Pelecanos, Jason / Sridharan, Sridha (1998): "On the convergence of Gaussian mixture models: improvements through vector quantization", In ICSLP-1998, paper 0667.