Odyssey 2012 - The Speaker and Language Recognition Workshop
In this paper, we propose a new clustering model for speaker diarization. A major problem with using greedy agglomerative hierarchical clustering for speaker diarization is that they do not guarantee an optimal solution. We propose a new clustering model, by redefining clustering as a problem of Integer Linear Programming (ILP). Thus an ILP solver can be used which searches the solution of speaker clustering over the whole problem. The experiments were conducted on the corpus of French broadcast news ESTER-2. With this new clustering, the DER decreases by 2.43 points.
Bibliographic reference. Rouvier, Mickael / Meignier, Sylvain (2012): "A global optimization framework for speaker diarization", In Odyssey-2012, 146-150.