We give a unification of several different speaker recognition problems in terms of the general speaker partitioning problem, where a set of N inputs has to be partitioned into subsets according to speaker. We show how to solve this problem in terms of a simple generative model and demonstrate performance on NIST SRE 2006 and 2008 data. Our solution yields probabilistic outputs, which we show how to evaluate with a cross-entropy criterion. Finally, we show improved accuracy of the generative model via a discriminatively trained re-calibration transformation of log-likelihoods.
Cite as: Brümmer, N., de Villiers, E. (2010) The speaker partitioning problem. Proc. The Speaker and Language Recognition Workshop (Odyssey 2010), paper 34
@inproceedings{brummer10_odyssey, author={Niko Brümmer and Edward {de Villiers}}, title={{The speaker partitioning problem}}, year=2010, booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2010)}, pages={paper 34} }