Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Efficiently Using Speaker Adaptation Data

Chengyi Zheng (1), Yonghong Yan (2)

(1) Center for Spoken Language Understanding, Oregon Graduate Institute, OR, USA
(2) Intel Corporation, USA

Transformation based speaker adaptation techniques, such as Maximum Likelihood Linear Regression (MLLR) [1] require a large amount of adaptation data to robustly estimate the transform matrices. In this paper, we present a new adaptation scheme that adjusts the adaptation data according to the feedback from recognizer. By giving different weights to different parts of the adaptation data, the proposed scheme can make use of the adaptation data more efficiently. Experiments on the WSJ 20K task show that this method achieved an additional 10% relative word error rate reduction in supervised adaptation and 2% reduction in unsupervised adaptation compared with conventional MLLR approach.

Reference

  1. C.L. Leggetter and P.C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density HMMs, Computer Speech and Language, Vol.9, pp. 171-185, 1995.


Full Paper

Bibliographic reference.  Zheng, Chengyi / Yan, Yonghong (2000): "Efficiently using speaker adaptation data", In ICSLP-2000, vol.4, 358-361.