Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Regression Transformation of Prior Means for Speaker Adaptation

Guoqiang Li, Limin Du, Ziqiang Hou

Institute of Acoustics, Chinese Academy of Sciences, Beijing, China

Maximum a posteriori adaptation method combines the prior knowledge with adaptation data from a new speaker, which has a nice asymptotical property, but has a slow adaptation rate for not modifying unseen models. In a strictly Bayesian approach, prior parameters are assumed known, based on common or subjective knowledge. But a practical solution is to adopt an empirical Bayesian approach, where the prior parameters are estimated directly from training speech data itself. So there is a problem of mismatches between training and testing conditions. In this paper we propose a prior parameter transformation (PPT) adaptation approach that transforms the prior parameters to be more representative of the new speaker. It can influence unseen models by tying prior parameter transformations across different models according to amount of adaptation data available. Based on the improved prior information better model parameters can be obtained even with small amount of adaptation data.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Li, Guoqiang / Du, Limin / Hou, Ziqiang (1999): "Regression transformation of prior means for speaker adaptation", In EUROSPEECH'99, 2507-2510.