ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

A study of computation speed-UPS of the GMM-UBM speaker recognition system

Jack McLaughlin, Douglas A. Reynolds, Terry Gleason

The Gaussian Mixture Model Universal Background Model (GMM-UBM) speaker recognition system has demonstrated very high performance in several NIST evaluations. Such evaluations, however, are concerned only with classification accuracy. In many applications, system effectiveness must be evaluated in light of both accuracy and execution speed. We present here a number of techniques for decreasing computation. Using data from the Switchboard telephone speech corpus, we show that significant speed-ups can be obtained while sacrificing surprisingly little accuracy. We expect that these techniques, involving lowering model order as well as processing fewer speech frames, will apply equally well to other recognition systems.


doi: 10.21437/Eurospeech.1999-284

Cite as: McLaughlin, J., Reynolds, D.A., Gleason, T. (1999) A study of computation speed-UPS of the GMM-UBM speaker recognition system. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1215-1218, doi: 10.21437/Eurospeech.1999-284

@inproceedings{mclaughlin99_eurospeech,
  author={Jack McLaughlin and Douglas A. Reynolds and Terry Gleason},
  title={{A study of computation speed-UPS of the GMM-UBM speaker recognition system}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1215--1218},
  doi={10.21437/Eurospeech.1999-284}
}