Odyssey 2008: The Speaker and Language Recognition Workshop

Stellenbosch, South Africa
January 21-24, 2008

Improving the Performance of Text-Independent Short Duration SVM- and GMM-based Speaker Verification

Benoit Fauve (1), Nicholas Evans (2,1), John Mason (1)

(1) Speech and Image Research Group, University of Wales Swansea, UK
(2) LIA, Universit´e d’Avignon et des Pays de Vaucluse, France

In the task of automatic speaker verification (ASV) it is well known that the duration of the speech signals is an important factor in the ultimate accuracy of the system. This paper deals with some of the aspects of adapting systems to work with limited amounts of data. First we highlight the importance of a well-tuned speech detection front-end when working with short durations. We consider a well-established technique (GMM) as well as a recent development (SVM on GMM mean supervectors), showing their limitations and alternatives. In particular the benefit of eigenvoice modelling in the context of short duration tasks is highlighted. Finally experiments on standard NIST databases demonstrate fusion potential between the presented techniques and significant gains when compared to a single GMM.

Full Paper     Presentation (PDF)

Bibliographic reference.  Fauve, Benoit / Evans, Nicholas / Mason, John (2008): "Improving the performance of text-independent short duration SVM- and GMM-based speaker verification", In Odyssey-2008, paper 018.