Odyssey 2010: The Speaker and Language Recognition Workshop

Brno, Czech Republic
28 June 1 July 2010

On the Use of Factor Analysis with Restricted Target Data in Speaker Verification

Javier Gonzalez-Dominguez (1), Brendan Baker, Robbie Vogt (2), Joaquin Gonzalez-Rodriguez (1), Sridha Sridharan (2)

(1) ATVS-UAM, (2) QUT

Factor Analysis (FA) based techniques have become the state of the art in automatic speaker verification thanks to their great ability to model session variability. This ability, in turn, relies on accurately estimating a session variability subspace for the operating conditions of interest. In cases such as forensic speaker recognition, however, this requirement cannot always be satisfied due to the very limited quantity of appropriate development data. As a first step toward understanding the application of FA in these restricted data scenarios, this work analyzes the performance of FA with very limited development data and then explores several FA estimation methods that augment the target domain data with examples from a data-rich domain. Experiments on NIST SRE 2006 microphone data conditions demonstrate that telephone data can be effectively exploited to improve performance over a baseline system.

Full Paper (PDF)

Bibliographic reference.  Gonzalez-Dominguez, Javier / Baker, Brendan / Vogt, Robbie / Gonzalez-Rodriguez, Joaquin / Sridharan, Sridha (2010): "On the Use of Factor Analysis with Restricted Target Data in Speaker Verification", In Odyssey-2010, paper 020.