ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering

Kyu J. Han, Shrikanth S. Narayanan

Agglomerative hierarchical speaker clustering (AHSC) has been widely used for classifying speech data by speaker characteristics. Its bottom-up, one-way structure of merging the closest cluster pair at every recursion step, however, makes it difficult to recover from incorrect merging. Hence, making AHSC robust to incorrect merging is an important issue. In this paper we address this problem in the framework of AHSC based on incremental Gaussian mixture models, which we previously introduced for better representing variable cluster size. Specifically, to minimize contamination in cluster models by heterogeneous data, we select and keep updating a representative (or signature) model for each cluster during AHSC. Experiments on meeting speech excerpts (4 hours total) verify that the proposed approach improves average speaker clustering performance by approximately 20% (relative).


doi: 10.21437/Interspeech.2009-671

Cite as: Han, K.J., Narayanan, S.S. (2009) Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering. Proc. Interspeech 2009, 2547-2550, doi: 10.21437/Interspeech.2009-671

@inproceedings{han09b_interspeech,
  author={Kyu J. Han and Shrikanth S. Narayanan},
  title={{Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2547--2550},
  doi={10.21437/Interspeech.2009-671}
}