ISCA Archive ASR 2000
ISCA Archive ASR 2000

Structural maximum a posteriori linear regression for fast HMM adaptation

Olivier Siohan, Tor André Myrvoll, Chin-Hui Lee

Transformation-based model adaptation techniques like maximum likelihood linear regression (MLLR) rely on an accurate selection of the number of transformations for a given amount of adaptation data. If too many transformations are used, the transformation parameters may be poorly estimated, can overfit the adaptation data, and offer poor generalization. On the other hand, if the number of transformations is too small, the adapted models can only provide a moderate improvement over the baseline models. An adaptation approach should therefore be flexible in order to estimate reliably a large number of transformations when the amount of adaptation data is large, and a small number of transformations when only a few adaptation utterances are available. In this work, we show that a significant improvement can be obtained over MLLR with dynamic regression classes, first by replacing the maximum likelihood estimation criterion by a maximum a posteriori criterion, then by introducing a tree-structure for the prior densities of the transformations. The effectiveness of the proposed approach is illustrated on the Spoke3 1993 test set of the WSJ task. Using the same regression classes as MLLR, it is shown that the proposed approach reduces the risk of overfitting and exploit the adaptation data much more efficiently than MLLR, leading to a significant reduction of the word error rate with as little as one adaptation utterance.


Cite as: Siohan, O., Myrvoll, T.A., Lee, C.-H. (2000) Structural maximum a posteriori linear regression for fast HMM adaptation. Proc. ASR2000 - Automatic Speech Recognition: Challenges for the New Millenium, 120-127

@inproceedings{siohan00_asr,
  author={Olivier Siohan and Tor André Myrvoll and Chin-Hui Lee},
  title={{Structural maximum a posteriori linear regression for fast HMM adaptation}},
  year=2000,
  booktitle={Proc. ASR2000 - Automatic Speech Recognition: Challenges for the New Millenium},
  pages={120--127}
}