Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering

Ignacio Viñals, Alfonso Ortega, Jesús Villalba, Antonio Miguel, Eduardo Lleida


This work presents a new strategy to perform diarization dealing with high variability data, such as multimedia information in broadcast. This variability is highly noticeable among domains (inter-domain variability among chapters, shows, genres, etc.). Therefore, each domain requires its own specific model to obtain the optimal results. We propose to adapt the PLDA models of our diarization system with in-domain unlabeled data. To do it, we estimate pseudo-speaker labels by unsupervised speaker clustering. This new method has been included in a PLDA-based diarization system and evaluated on the Multi-Genre Broadcast 2015 Challenge data. Given an audio, the system computes short-time i-vectors and clusters them using a variational Bayesian PLDA model with hidden labels. The proposed method improves 25.41% relative w.r.t. the system without PLDA adaptation.


 DOI: 10.21437/Interspeech.2017-84

Cite as: Viñals, I., Ortega, A., Villalba, J., Miguel, A., Lleida, E. (2017) Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering. Proc. Interspeech 2017, 2829-2833, DOI: 10.21437/Interspeech.2017-84.


@inproceedings{Viñals2017,
  author={Ignacio Viñals and Alfonso Ortega and Jesús Villalba and Antonio Miguel and Eduardo Lleida},
  title={Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={2829--2833},
  doi={10.21437/Interspeech.2017-84},
  url={http://dx.doi.org/10.21437/Interspeech.2017-84}
}