8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003


SOM as Likelihood Estimator for Speaker Clustering

Itshak Lapidot

IDIAP, Switzerland

A new approach is presented for clustering the speakers from unlabeled and unsegmented conversation, when the number of speakers is unknown. In this approach, Self-Organizing-Map (SOM) is used as likelihood estimators for speaker model. For estimation of the number of clusters the Bayesian Information Criterion (BIC) is applied. This approach was tested on the NIST 1996 HUB-4 evaluation test in terms of speaker and cluster purities. Results indicate that the combined SOM-BIC approach can lead to better clustering results than the baseline system.

Full Paper

Bibliographic reference.  Lapidot, Itshak (2003): "SOM as likelihood estimator for speaker clustering", In EUROSPEECH-2003, 3001-3004.