On Robustness of Unsupervised Domain Adaptation for Speaker Recognition

Pierre-Michel Bousquet, Mickael Rouvier

Current speaker recognition systems, that are learned by using wide training datasets and include sophisticated modelings, turn out to be very specific, providing sometimes disappointing results in real-life applications. Any shift between training and test data, in terms of device, language, duration, noise or other tends to degrade accuracy of speaker detection. This study investigates unsupervised domain adaptation,when only a scarce and unlabeled “in-domain” development dataset is available. Details and relevance of different approaches are described and commented, leading to a new robust method that we call feature-Distribution Adaptor. Efficiency of the proposed technique is experimentally validated on the recent NIST 2016 and 2018 Speaker Recognition Evaluation datasets.

 DOI: 10.21437/Interspeech.2019-1524

Cite as: Bousquet, P., Rouvier, M. (2019) On Robustness of Unsupervised Domain Adaptation for Speaker Recognition. Proc. Interspeech 2019, 2958-2962, DOI: 10.21437/Interspeech.2019-1524.

  author={Pierre-Michel Bousquet and Mickael Rouvier},
  title={{On Robustness of Unsupervised Domain Adaptation for Speaker Recognition}},
  booktitle={Proc. Interspeech 2019},