Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Unsupervised Clustering of Spontaneous Speech Documents

Edgar GonzÓlez, Jordi Turmo

Universitat PolitŔcnica de Catalunya, Barcelona, Spain

This paper presents an unsupervised method for clustering spontaneous speech documents. The approach uses a hierarchical algorithm to automatically determine the number of clusters and a starting model for a subsequent iterative algorithm. We have evaluated this method on the Switchboard corpus and compared it to a set of supervised and other unsupervised methods. The results show that our method significantly outperforms the rest of the approaches.

Full Paper

Bibliographic reference.  GonzÓlez, Edgar / Turmo, Jordi (2005): "Unsupervised clustering of spontaneous speech documents", In INTERSPEECH-2005, 609-612.