ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping

Bowen Zhou, John H. L. Hansen

In this paper, we propose a novel algorithm for rapid speaker adaptation based on our Structural Maximum Likelihood Eigenspace Mapping (SMLEM). The proposed method constructs a binary-tree structured hierarchical Speaker Independent (SI) eigenspace at different levels from well-trained SI system models, and then dynamically constructs a new set of speaker dependent (SD) eigenspaces at corresponding levels, according to the availability of incoming adaptation data. By mapping the mixture Gaussian components from a SI eigenspace to SD eigenspaces in a maximum likelihood manner, the SI models are adapted towards SD models (EM algorithm is used to derive the eigenspace bias). Compared with conventional MLLR, the proposed algorithm is both computationally cheaper and more effective when only a very small amount (from 5 to 15 seconds) of adaptation data is available. In our simulations using the DARPA WSJ Spoke3 corpus, an average of 10.5% relative reduction in WER was achieved over MLLR adaptation when using 5 seconds data for adaptation.


doi: 10.21437/Eurospeech.2001-316

Cite as: Zhou, B., Hansen, J.H.L. (2001) A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1215-1218, doi: 10.21437/Eurospeech.2001-316

@inproceedings{zhou01_eurospeech,
  author={Bowen Zhou and John H. L. Hansen},
  title={{A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={1215--1218},
  doi={10.21437/Eurospeech.2001-316}
}