ISCA Archive Odyssey 2012
ISCA Archive Odyssey 2012

Adaptation transforms of auto-associative neural networks as features for speaker verification

Samuel Thomas, Sri Harish Mallidi, Sriram Ganapathy, Hynek Hermansky

We present a new approach of using Auto-Associative Neural Networks (AANNs) in the conventional GMM speaker verification framework with i-vector feature extraction and PLDA modeling. In this technique, an i-vector feature extractor is trained using adaptation parameters from a mixture of AANNs. In order to model parts of each speaker's acoustic space, a training objective function based on posterior probabilities of broad phonetic classes is used. The AANN based i-vectors are fused with GMM based i-vectors and a joint PLDA model is trained. The proposed approach provides promising results and significant gains when combined with baseline systems on the telephone conditions of NIST SRE 2010 and the recently concluded IARPA BEST 2011 speaker evaluations.


Cite as: Thomas, S., Mallidi, S.H., Ganapathy, S., Hermansky, H. (2012) Adaptation transforms of auto-associative neural networks as features for speaker verification. Proc. The Speaker and Language Recognition Workshop (Odyssey 2012), 98-104

@inproceedings{thomas12_odyssey,
  author={Samuel Thomas and Sri Harish Mallidi and Sriram Ganapathy and Hynek Hermansky},
  title={{Adaptation transforms of auto-associative neural networks as features for speaker verification}},
  year=2012,
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2012)},
  pages={98--104}
}