ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Bilinear transformation space-based maximum likelihood linear regression frameworks

Hwa Jeon Song, Yongwon Jeong, Hyung Soon Kim

This paper proposes two types of bilinear transformation spacebased speaker adaptation frameworks. In training session, transformation matrices for speakers are decomposed into the style factor for speakersÂ’ characteristics and orthonormal basis of eigenvectors to control dimensionality of the canonical model by the singular value decomposition-based algorithm. In adaptation session, the style factor of a new speaker is estimated, depending on what kind of proposed framework is used. At the same time, the dimensionality of the canonical model can be reduced by the orthonormal basis from training. Moreover, both maximum likelihood linear regression (MLLR) and eigenspace-based MLLR are identified as special cases of our proposed methods. Experimental results show that the proposed methods are much more effective and versatile than other methods


doi: 10.21437/Interspeech.2009-197

Cite as: Song, H.J., Jeong, Y., Kim, H.S. (2009) Bilinear transformation space-based maximum likelihood linear regression frameworks. Proc. Interspeech 2009, 548-551, doi: 10.21437/Interspeech.2009-197

@inproceedings{song09_interspeech,
  author={Hwa Jeon Song and Yongwon Jeong and Hyung Soon Kim},
  title={{Bilinear transformation space-based maximum likelihood linear regression frameworks}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={548--551},
  doi={10.21437/Interspeech.2009-197}
}