9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Agglomerative Hierarchical Speaker Clustering Using Incremental Gaussian Mixture Cluster Modeling

Kyu J. Han, Shrikanth S. Narayanan

University of Southern California, USA

This paper proposes a novel cluster modeling method for intercluster distance measurement within the framework of agglomerative hierarchical speaker clustering, namely, incremental Gaussian mixture cluster modeling. This method uses a single Gaussian distribution to model each initial cluster, but represents any newly merged cluster using a distribution whose pdf is the weighted sum of the pdf's of the respective model distributions for the clusters involved in the particular merging process. As a result, clusters are smoothly transitioned to be modeled by Gaussian mixtures whose components are incremented as merging recursions continue during clustering. The proposed method can overcome the limited cluster representation capability of conventional single Gaussian cluster modeling. Through experiments on various sets of initial clusters, it is demonstrated that our approach consequently improves the reliability of speaker clustering performance.

Full Paper

Bibliographic reference.  Han, Kyu J. / Narayanan, Shrikanth S. (2008): "Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling", In INTERSPEECH-2008, 20-23.