Sixth European Conference on Speech Communication and Technology
In this paper, a speaker adaptation method is presented which computes the speaker adapted model by a weighted sum of a set of speaker dependent models. The set of weights are estimated to maximize the likelihood of the adaptation data. Then a linguistic tree is constructed to cluster the mean vectors. The means in the same linguistic class share the same weight set, while the means in different classes use different weight set to compute the adapted model. Experiments show that with as little as 1-3 sentences a significant performance improvement is obtained. As more adaptation data is available, further improvement can be obtained.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Feng, Liu / Che, Chi-wei / Yu, Peng / Wang, Zuoying (1999): "Linguistic tree based maximum likelihood model interpolation", In EUROSPEECH'99, 2511-2514.