ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Online model adaptation for voice conversion using model-based speech synthesis techniques

Dalei Wu, Baojie Li, Hui Jiang, Qian-Jie Fu

In this paper, we present a novel voice conversion method using model-based speech synthesis that can be used for some applications where prior knowledge or training data is not available from the source speaker. In the proposed method, training data from a target speaker is used to build a GMM-based speech model and voice conversion is then performed for each utterance from the source speaker according to the pre-trained target speaker model. To reduce the mismatch between source and target speakers, online model adaptation is proposed to improve model selection accuracy, based on maximum likelihood linear regression (MLLR). Objective and subjective evaluations suggest that the proposed methods are quite effective in generating acceptable voice quality for voice conversion even without training data from source speakers.


doi: 10.21437/Interspeech.2009-490

Cite as: Wu, D., Li, B., Jiang, H., Fu, Q.-J. (2009) Online model adaptation for voice conversion using model-based speech synthesis techniques. Proc. Interspeech 2009, 1643-1646, doi: 10.21437/Interspeech.2009-490

@inproceedings{wu09d_interspeech,
  author={Dalei Wu and Baojie Li and Hui Jiang and Qian-Jie Fu},
  title={{Online model adaptation for voice conversion using model-based speech synthesis techniques}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1643--1646},
  doi={10.21437/Interspeech.2009-490}
}