September 22-25, 1997
In this paper we show how a confusion matrix derived from phone identification experiments can be used to automatically generate phone clusters. These clusters can be applied when constructing triphone models to overcome the sparse data problem. Two techniques are presented; firstly an hierarchical clustering technique is described; then an open clustering technique is presented. Both of these use mutual information calculated on a probability distribution derived from the confusion matrix as a measure of phone similarity. Sample results from each technique are presented.
Bibliographic reference. O'Boyle, Peter / Ming, Ji / Owens, Marie / Smith, F. Jack (1997): "From phone identification to phone clustering using mutual information", In EUROSPEECH-1997, 2391-2394.