ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Weighted segmental k-means initialization for SOM-based speaker clustering

Oshry Ben-Harush, Itshak Lapidot, Hugo Guterman

A new approach for initial assignment of data in a speaker clustering application is presented. This approach employs Weighted Segmental K-Means clustering algorithm prior to competitive based learning. The clustering system relies on Self-Organizing Maps (SOM) for speaker modeling and likelihood estimation. Performance is evaluated on 108 two speaker conversations taken from LDC CALLHOME American English Speech corpus using NIST criterion and shows an improvement of approximately 48% in Cluster Error Rate (CER) relative to the randomly initialized clustering system. The number of iterations was reduced significantly, which contributes to both speed and efficiency of the clustering system.


doi: 10.21437/Interspeech.2008-4

Cite as: Ben-Harush, O., Lapidot, I., Guterman, H. (2008) Weighted segmental k-means initialization for SOM-based speaker clustering. Proc. Interspeech 2008, 24-27, doi: 10.21437/Interspeech.2008-4

@inproceedings{benharush08_interspeech,
  author={Oshry Ben-Harush and Itshak Lapidot and Hugo Guterman},
  title={{Weighted segmental k-means initialization for SOM-based speaker clustering}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={24--27},
  doi={10.21437/Interspeech.2008-4}
}