Odyssey 2008: The Speaker and Language Recognition Workshop

Stellenbosch, South Africa
January 21-24, 2008

Factor Analysis Modelling for Speaker Verification with Short Utterances

Robbie Vogt, Chris Lustri, Sridha Sridharan

Speech Research Laboratory, Queensland University of Technology, Brisbane, Australia

This paper examines combining both relevance MAP and subspace speaker adaptation processes to train GMM speaker models for use in speaker verification systems with a particular focus on short utterance lengths. The subspace speaker adaptation method involves developing a speaker GMM mean supervector as the sum of a speaker-independent prior distribution and a speaker dependent offset constrained to lie within a low-rank subspace, and has been shown to provide improvements in accuracy over ordinary relevance MAP when the amount of training data is limited. It is shown through testing on NIST SRE data that combining the two processes provides speaker models which lead to modest improvements in verification accuracy for limited data situations, in addition to improving the performance of the speaker verification system when a larger amount of available training data is available.

Full Paper

Bibliographic reference.  Vogt, Robbie / Lustri, Chris / Sridharan, Sridha (2008): "Factor analysis modelling for speaker verification with short utterances", In Odyssey-2008, paper 019.