ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Efficiently using speaker adaptation data

Chengyi Zheng, Yonghong Yan

Transformation based speaker adaptation techniques, such as Maximum Likelihood Linear Regression (MLLR) [1] require a large amount of adaptation data to robustly estimate the transform matrices. In this paper, we present a new adaptation scheme that adjusts the adaptation data according to the feedback from recognizer. By giving different weights to different parts of the adaptation data, the proposed scheme can make use of the adaptation data more efficiently. Experiments on the WSJ 20K task show that this method achieved an additional 10% relative word error rate reduction in supervised adaptation and 2% reduction in unsupervised adaptation compared with conventional MLLR approach.

C.L. Leggetter and P.C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density HMMs, Computer Speech and Language, Vol.9, pp. 171-185, 1995.

Cite as: Zheng, C., Yan, Y. (2000) Efficiently using speaker adaptation data. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 358-361

  author={Chengyi Zheng and Yonghong Yan},
  title={{Efficiently using speaker adaptation data}},
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 358-361}