ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Using maximum likelihood linear regression for segment clustering and speaker identification

Michiel Bacchiani

Many adaptation scenarios rely on clustering of either the test or training data. Although consistency between the clustering and adaptation objective functions is desired, most previous approaches have not implemented such consistency. This paper shows that the statistics used in Maximum Likelihood Linear Regression (MLLR) adaptation are sucient to cluster data with a consistent Maximum Likelihood (ML) criterion. In addition, as the algorithm uses the same statistics for both adaptation and clustering, it is computationally ecient. Clustering experiments contrasting the performance of this algorithm with the widely used text independent Gaussian mixture model approach show increased adaptation likelihoods and consistency of within-cluster speaker identity. In a speaker identification experiment the adaptation-based scoring showed improved classi fication performance compared to the mixture model-based scoring.


Cite as: Bacchiani, M. (2000) Using maximum likelihood linear regression for segment clustering and speaker identification. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 536-539

@inproceedings{bacchiani00_icslp,
  author={Michiel Bacchiani},
  title={{Using maximum likelihood linear regression for segment clustering and speaker identification}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 536-539}
}