Interspeech'2005 - Eurospeech
This paper presents an unsupervised method for clustering spontaneous speech documents. The approach uses a hierarchical algorithm to automatically determine the number of clusters and a starting model for a subsequent iterative algorithm. We have evaluated this method on the Switchboard corpus and compared it to a set of supervised and other unsupervised methods. The results show that our method significantly outperforms the rest of the approaches.
Bibliographic reference. GonzÓlez, Edgar / Turmo, Jordi (2005): "Unsupervised clustering of spontaneous speech documents", In INTERSPEECH-2005, 609-612.