ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Unsupervised clustering of spontaneous speech documents

Edgar Gonzàlez, Jordi Turmo

This paper presents an unsupervised method for clustering spontaneous speech documents. The approach uses a hierarchical algorithm to automatically determine the number of clusters and a starting model for a subsequent iterative algorithm. We have evaluated this method on the Switchboard corpus and compared it to a set of supervised and other unsupervised methods. The results show that our method significantly outperforms the rest of the approaches.

doi: 10.21437/Interspeech.2005-63

Cite as: Gonzàlez, E., Turmo, J. (2005) Unsupervised clustering of spontaneous speech documents. Proc. Interspeech 2005, 609-612, doi: 10.21437/Interspeech.2005-63

  author={Edgar Gonzàlez and Jordi Turmo},
  title={{Unsupervised clustering of spontaneous speech documents}},
  booktitle={Proc. Interspeech 2005},