ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification

Najim Dehak, Réda Dehak, Patrick Kenny, Niko Brümmer, Pierre Ouellet, Pierre Dumouchel

This paper presents a new speaker verification system architecture based on Joint Factor Analysis (JFA) as feature extractor. In this modeling, the JFA is used to define a new low-dimensional space named the total variability factor space, instead of both channel and speaker variability spaces for the classical JFA. The main contribution in this approach, is the use of the cosine kernel in the new total factor space to design two different systems: the first system is Support Vector Machines based, and the second one uses directly this kernel as a decision score. This last scoring method makes the process faster and less computation complex compared to others classical methods. We tested several intersession compensation methods in total factors, and we found that the combination of Linear Discriminate Analysis and Within Class Covariance Normalization achieved the best performance. We achieved a remarkable results using fast scoring method based only on cosine kernel especially for male trials, we yield an EER of 1.12% and MinDCF of 0.0094 on the English trials of the NIST 2008 SRE dataset.


doi: 10.21437/Interspeech.2009-385

Cite as: Dehak, N., Dehak, R., Kenny, P., Brümmer, N., Ouellet, P., Dumouchel, P. (2009) Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification. Proc. Interspeech 2009, 1559-1562, doi: 10.21437/Interspeech.2009-385

@inproceedings{dehak09_interspeech,
  author={Najim Dehak and Réda Dehak and Patrick Kenny and Niko Brümmer and Pierre Ouellet and Pierre Dumouchel},
  title={{Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1559--1562},
  doi={10.21437/Interspeech.2009-385}
}