ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Cross-language voice conversion based on eigenvoices

Malorie Charlier, Yamato Ohtani, Tomoki Toda, Alexis Moinet, Thierry Dutoit

This paper presents a novel cross-language voice conversion (VC) method based on eigenvoice conversion (EVC). Cross-language VC is a technique for converting voice quality between two speakers uttering different languages each other. In general, parallel data consisting of utterance pairs of those two speakers are not available. To deal with this problem, we apply EVC to cross-language VC. First, we train an eigenvoice GMM (EV-GMM) using many parallel data sets by a source speaker and many pre-stored other speakers who can utter the same language as the source speaker. And then, the conversion model between the source speaker and a target speaker who cannot utter the source speaker’s language is developed by adapting the EV-GMM using a few arbitrary sentences uttered by the target speaker in a different language. The experimental results demonstrate that the proposed method yields significant performance improvements in both speech quality and conversion accuracy for speaker individuality compared with a conventional cross-language VC method based on frame selection.

doi: 10.21437/Interspeech.2009-488

Cite as: Charlier, M., Ohtani, Y., Toda, T., Moinet, A., Dutoit, T. (2009) Cross-language voice conversion based on eigenvoices. Proc. Interspeech 2009, 1635-1638, doi: 10.21437/Interspeech.2009-488

  author={Malorie Charlier and Yamato Ohtani and Tomoki Toda and Alexis Moinet and Thierry Dutoit},
  title={{Cross-language voice conversion based on eigenvoices}},
  booktitle={Proc. Interspeech 2009},