Estimating Speaker Clustering Quality Using Logistic Regression

Yishai Cohen, Itshak Lapidot


This paper focuses on estimating clustering validity by using logistic regression. For many applications it might be important to estimate the quality of the clustering, e.g. in case of speech segments’ clustering, make a decision whether to use the clustered data for speaker verification. In the case of short segments speakers clustering, the common criteria for cluster validity are average cluster purity (ACP), average speaker purity (ASP) and K — the geometric mean between the two measures. As in practice, true labels are not available for evaluation, hence they have to be estimated from the clustering itself. In this paper, mean-shift clustering with PLDA score is applied in order to cluster short speaker segments represented as i-vectors. Different statistical parameters are then estimated on the clustered data and are used to train logistic regression to estimate ACP, ASP and K. It was found that logistic regression can be a good predictor of the actual ACP, ASP and K, and yields reasonable information regarding the clustering quality.


 DOI: 10.21437/Interspeech.2017-492

Cite as: Cohen, Y., Lapidot, I. (2017) Estimating Speaker Clustering Quality Using Logistic Regression. Proc. Interspeech 2017, 3577-3581, DOI: 10.21437/Interspeech.2017-492.


@inproceedings{Cohen2017,
  author={Yishai Cohen and Itshak Lapidot},
  title={Estimating Speaker Clustering Quality Using Logistic Regression},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={3577--3581},
  doi={10.21437/Interspeech.2017-492},
  url={http://dx.doi.org/10.21437/Interspeech.2017-492}
}