ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Dataset-invariant covariance normalization for out-domain PLDA speaker verification

Md. Hafizur Rahman, Ahilan Kanagasundaram, David Dean, Sridha Sridharan

In this paper we introduce a novel domain-invariant covariance normalization (DICN) technique to relocate both in-domain and out-domain i-vectors into a third dataset-invariant space, providing an improvement for out-domain PLDA speaker verification with a very small number of unlabelled in-domain adaptation i-vectors. By capturing the dataset variance from a global mean using both development out-domain i-vectors and limited unlabelled in-domain i-vectors, we could obtain domain-invariant representations of PLDA training data. The DICN-compensated out-domain PLDA system is shown to perform as well as in-domain PLDA training with as few as 500 unlabelled in-domain i-vectors for NIST-2010 SRE and 2000 unlabelled in-domain i-vectors for NIST-2008 SRE, and considerable relative improvement over both out-domain and in-domain PLDA development if more are available.


doi: 10.21437/Interspeech.2015-276

Cite as: Rahman, M.H., Kanagasundaram, A., Dean, D., Sridharan, S. (2015) Dataset-invariant covariance normalization for out-domain PLDA speaker verification. Proc. Interspeech 2015, 1017-1021, doi: 10.21437/Interspeech.2015-276

@inproceedings{rahman15_interspeech,
  author={Md. Hafizur Rahman and Ahilan Kanagasundaram and David Dean and Sridha Sridharan},
  title={{Dataset-invariant covariance normalization for out-domain PLDA speaker verification}},
  year=2015,
  booktitle={Proc. Interspeech 2015},
  pages={1017--1021},
  doi={10.21437/Interspeech.2015-276}
}