ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Selective training of HMMs by using two-stage clustering

Shoei Sato, Toru Imai, Hideki Tanaka, Akio Ando

This paper proposes a method of constructing acoustic models from training data clustered in two stages. In the first stage, training data from a target task are clustered and generate GMMs for each cluster. The second stage uses the GMMs to select training data from a large-scale database based on the GMM likelihood. MAP estimation adapts an acoustic model for each cluster using the selected training data. In decoding, the best acoustic model is selected from all acoustic models based on the GMM likelihood using some initial frames of an input utterance. Broadcast news transcription experiments showed that the proposed models achieved a word error reduction of 20% and a processing time reduction of 22%, compared with a non-clustered model.


Cite as: Sato, S., Imai, T., Tanaka, H., Ando, A. (2000) Selective training of HMMs by using two-stage clustering. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 726-729

@inproceedings{sato00b_icslp,
  author={Shoei Sato and Toru Imai and Hideki Tanaka and Akio Ando},
  title={{Selective training of HMMs by using two-stage clustering}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 726-729}
}