Priors for Speaker Counting and Diarization with AHC

Gregory Sell, Alan McCree, Daniel Garcia-Romero


Estimating the number of speakers in an audio segment is a necessary step in the process of speaker diarization, but current diarization algorithms do not explicitly define a prior probability on this estimation. This work proposes a process for including priors in speaker diarization with agglomerative hierarchical clustering (AHC). It is also shown that the exclusion of a prior with AHC is itself implicitly a prior, which is found to be geometric growth in the number of speakers. By using more sensible priors, we are able to demonstrate significantly improved robustness to calibration error for speaker counting and speaker diarization.


DOI: 10.21437/Interspeech.2016-1380

Cite as

Sell, G., McCree, A., Garcia-Romero, D. (2016) Priors for Speaker Counting and Diarization with AHC. Proc. Interspeech 2016, 2194-2198.

Bibtex
@inproceedings{Sell+2016,
author={Gregory Sell and Alan McCree and Daniel Garcia-Romero},
title={Priors for Speaker Counting and Diarization with AHC},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1380},
url={http://dx.doi.org/10.21437/Interspeech.2016-1380},
pages={2194--2198}
}