Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition

Alan McCree, Gregory Sell, Daniel Garcia-Romero


Probabilistic Linear Discriminant Analysis (PLDA) continues to be the most effective approach for speaker recognition in the i-vector space. This paper extends the PLDA model to include both enrollment and test cut duration as well as to distinguish between session and channel variability. In addition, we address the task of unsupervised adaptation to unknown new domains in two ways: speaker-dependent PLDA parameters and cohort score normalization using Bayes rule. Experimental results on the NIST SRE16 task show that these principled techniques provide state-of-the-art performance with negligible increase in complexity over a PLDA baseline.


 DOI: 10.21437/Interspeech.2017-1586

Cite as: McCree, A., Sell, G., Garcia-Romero, D. (2017) Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition. Proc. Interspeech 2017, 1552-1556, DOI: 10.21437/Interspeech.2017-1586.


@inproceedings{McCree2017,
  author={Alan McCree and Gregory Sell and Daniel Garcia-Romero},
  title={Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={1552--1556},
  doi={10.21437/Interspeech.2017-1586},
  url={http://dx.doi.org/10.21437/Interspeech.2017-1586}
}