Odyssey 2010: The Speaker and Language Recognition Workshop
Brno, Czech Republic
This paper investigates the effects of limited speech data in the context of speaker verification using the Gaussian mixture model (GMM) mean supervector support vector machine (SVM) classifier. This classifier provides state-of-the-art performance when sufficient speech is available, however, its robustness to the effects of limited speech resources has not yet been ascertained. Verification performance is analysed with regards to the duration of impostor utterances used for background, score normalisation and session compensation training cohorts. Results highlight the importance of matching the speech duration of utterances in these cohorts to the expected evaluation conditions. Performance was shown to be particularly sensitive to the utterance duration of examples in the background dataset. It was also found that the nuisance attribute projection (NAP) approach to session compensation often degrades performance when both training and testing data are limited. An analysis of the session and speaker variability in the mean supervector space provides some insight into the cause of this phenomenon.
Full Paper (PDF)
Bibliographic reference. McLaren, Mitchell / Vogt, Robbie / Baker, Brendan / Sridharan, Sridha (2010): "Experiments in SVM-based Speaker Verification Using Short Utterances", In Odyssey-2010, paper 017.