10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

GMM Kernel by Taylor Series for Speaker Verification

Minqiang Xu (1), Xi Zhou (2), Beiqian Dai (1), Thomas S. Huang (2)

(1) University of Science & Technology of China, China
(2) University of Illinois at Urbana-Champaign, USA

Currently, approach of Gaussian Mixture Model combined with Support Vector Machine to text-independent speaker verification task has produced the stat-of-the-art performance. Many kernels have been reported for combining GMM and SVM.

In this paper, we propose a novel kernel to represent the GMM distribution by Taylor expansion theorem and itís regarded as the input of SVM. The utterance-specific GMM is represented as a combination of orders of Taylor series expansing at the means of the Gaussian components. Here we extract the distribution information around the means of the Gaussian components in the GMM as we can naturally assume that each mean position indicates a feature cluster in the feature space. And then the kernel computes the emsemble distance between orders of Taylor series.

Results of our new kernel on NIST speaker recognition evaluation (SRE) 2006 core task have been shown relative improvements of up to 7.1% and 11.7% in EER for male and female compared to K-L divergence based SVM system.

Full Paper

Bibliographic reference.  Xu, Minqiang / Zhou, Xi / Dai, Beiqian / Huang, Thomas S. (2009): "GMM kernel by Taylor series for speaker verification", In INTERSPEECH-2009, 1283-1286.