EUROSPEECH '97

Gaussian mixture models (GMM's) have been demonstrated as one of the powerful statistical methods for speaker identification. In GMM method, the covariance matrix is usually assumed to be diagonal. That means the feature components are relatively uncorrelated. This assumption may not be correct. This paper concentrates on finding an orthogonal speakerdependent transformation to reduce the correlation between feature components. This transformation is based on the eigenvectors of the withinclass scatter matrix which is attained in each stage of iterative training of GMM parameters. Hence the transformation matrix and GMM parameters are both updated in each iteration until the total loglikelihood converges. An experimental evaluation of the proposed method is conducted on a 100person connected digit database for text independent speaker identification. The experimental result shows a reduction in the error rate by 42% when 7digit utterances are used for testing.
Bibliographic reference. Yuo, KuoHwei / Wang, HsiaoChuan (1997): "Gaussian mixture models with common principal axes and their application in textindependent speaker identification", In EUROSPEECH1997, 22792282.