10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Variational Dynamic Kernels for Speaker Verification

C. Longworth, R. C. van Dalen, M. J. F. Gales

University of Cambridge, UK

An important aspect of SVM-based speaker verification is the choice of dynamic kernel. Recently there has been interest in the use of kernels based on the Kullback-Leibler divergence between GMMs. Since this has no closed-form solution, typically a matched-pair upper bound is used instead. This places significant restrictions on the forms of model structure that may be used. All GMMs must contain the same number of components and must be adapted from a single background model. For many tasks this will not be optimal. In this paper, dynamic kernels are proposed based on alternative, variational approximations to the KL divergence. Unlike the matched-pair bound, these do not restrict the forms of GMM that may be used. Additionally, using a more accurate approximation of the divergence may lead to performance gains. Preliminary results using these kernels are presented on the NIST 2002 SRE dataset.

Full Paper

Bibliographic reference.  Longworth, C. / Dalen, R. C. van / Gales, M. J. F. (2009): "Variational dynamic kernels for speaker verification", In INTERSPEECH-2009, 1571-1574.