ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis

Yi-Jian Wu, Yoshihiko Nankaku, Keiichi Tokuda

A phone mapping-based method had been introduced for crosslingual speaker adaptation in HMM-based speech synthesis. In this paper, we continue to propose a state mapping based method for cross-lingual speaker adaptation. In this method, we firstly establish the state mapping between two voice models in source and target languages using Kullback-Leibler divergence (KLD). Based on the established mapping information, we introduce two approaches to conduct cross-lingual speaker adaptation, including data mapping and transform mapping approaches. From the experimental results, the state mapping based method outperformed the phone mapping based method. In addition, the data mapping approach achieved better speaker similarity, and the transform mapping approach achieved better speech quality after adaptation.


doi: 10.21437/Interspeech.2009-192

Cite as: Wu, Y.-J., Nankaku, Y., Tokuda, K. (2009) State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis. Proc. Interspeech 2009, 528-531, doi: 10.21437/Interspeech.2009-192

@inproceedings{wu09b_interspeech,
  author={Yi-Jian Wu and Yoshihiko Nankaku and Keiichi Tokuda},
  title={{State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={528--531},
  doi={10.21437/Interspeech.2009-192}
}