Iterative PLDA Adaptation for Speaker Diarization

Gaël Le Lan, Delphine Charlet, Anthony Larcher, Sylvain Meignier


This paper investigates iterative PLDA adaptation for cross-show speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradually improve an existing system trained on external annotated data. Such results indicate that performing speaker diarization on small collections of unlabeled audio archives should only rely on the availability of a sufficient bootstrap system, which can be incrementally adapted to every target collection. The proposed framework also widens the range of acceptable speaker clustering thresholds for a given performance objective.


DOI: 10.21437/Interspeech.2016-572

Cite as

Lan, G.L., Charlet, D., Larcher, A., Meignier, S. (2016) Iterative PLDA Adaptation for Speaker Diarization. Proc. Interspeech 2016, 2175-2179.

Bibtex
@inproceedings{Lan+2016,
author={Gaël Le Lan and Delphine Charlet and Anthony Larcher and Sylvain Meignier},
title={Iterative PLDA Adaptation for Speaker Diarization},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-572},
url={http://dx.doi.org/10.21437/Interspeech.2016-572},
pages={2175--2179}
}