Odyssey 2008: The Speaker and Language Recognition Workshop
Stellenbosch, South Africa
We present a comparison between speaker verification systems based on factor analysis modeling and support vector machines using GMM supervectors as features. All systems used the same acoustic features and they were trained and tested on the same data sets. We test two types of kernel (one linear, the other non-linear) for the GMM support vector machines. The results show that factor analysis using speaker factors gives the best results on the core condition of the NIST 2006 speaker recognition evaluation. The difference is particularly marked on the English language subset. Fusion of all systems gave an equal error rate of 4.2% (all trials) and 3.2% (English trials only).
Bibliographic reference. Dehak, Najim / Dehak, Réda / Kenny, Patrick / Dumouchel, Pierre (2008): "Comparison between factor analysis and GMM support vector machines for speaker verification", In Odyssey-2008, paper 009.