Odyssey 2012 - The Speaker and Language Recognition Workshop
We present a new approach of using Auto-Associative Neural Networks (AANNs) in the conventional GMM speaker verification framework with i-vector feature extraction and PLDA modeling. In this technique, an i-vector feature extractor is trained using adaptation parameters from a mixture of AANNs. In order to model parts of each speaker's acoustic space, a training objective function based on posterior probabilities of broad phonetic classes is used. The AANN based i-vectors are fused with GMM based i-vectors and a joint PLDA model is trained. The proposed approach provides promising results and significant gains when combined with baseline systems on the telephone conditions of NIST SRE 2010 and the recently concluded IARPA BEST 2011 speaker evaluations.
Bibliographic reference. Thomas, Samuel / Mallidi, Sri Harish / Ganapathy, Sriram / Hermansky, Hynek (2012): "Adaptation transforms of auto-associative neural networks as features for speaker verification", In Odyssey-2012, 98-104.