ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

What is the best type of prior distribution for EMAP speaker adaptation?

Patrick Kenny, Gilles Boulianne, Pierre Dumouchel

There are two types of prior distribution that can be viewed as natural for extended MAP (or EMAP) speaker adaptation. One arises from modeling the correlations between speakers (assumed to be constant across HMM Gaussians) and the other from modeling the correlations between HMM Gaussians (assumed to be constant across speakers). In this paper we present new results establishing the usefulness of correlations of the first type for speaker adaptation and we outline a tensor product construction which enables both types of correlation to be integrated in a common mathematical framework. We also present the results of some experiments which suggest that the two types of correlation are equally effective for speaker adaptation and that there is no incremental improvement to be gained by modeling both of them simultaneously.


doi: 10.21437/Eurospeech.2001-314

Cite as: Kenny, P., Boulianne, G., Dumouchel, P. (2001) What is the best type of prior distribution for EMAP speaker adaptation? Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1207-1210, doi: 10.21437/Eurospeech.2001-314

@inproceedings{kenny01_eurospeech,
  author={Patrick Kenny and Gilles Boulianne and Pierre Dumouchel},
  title={{What is the best type of prior distribution for EMAP speaker adaptation?}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={1207--1210},
  doi={10.21437/Eurospeech.2001-314}
}