ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Japanese pitch conversion for voice morphing based on differential modeling

Ryuki Tachibana, Zhiwei Shuang, Masafumi Nishimura

In this paper, we convert the pitch contours predicted by a TTS system that models a source speaker to resemble the pitch contours of a target speaker. When the speaking styles of the speakers are very different, complex conversions such as adding or deleting pitch peaks may be required. Our method does the conversions by modeling the direct pitch features and differential pitch features at the same time based on linguistic features. The differential pitch features are calculated from matched pairs of source and target pitch values. We show experimental results in which the target speaker’s characteristics are successfully modeled based on a very limited training corpus. The proposed pitch conversion method stretches the possibilities of TTS customization for various speaking styles.


doi: 10.21437/Interspeech.2009-497

Cite as: Tachibana, R., Shuang, Z., Nishimura, M. (2009) Japanese pitch conversion for voice morphing based on differential modeling. Proc. Interspeech 2009, 2651-2654, doi: 10.21437/Interspeech.2009-497

@inproceedings{tachibana09_interspeech,
  author={Ryuki Tachibana and Zhiwei Shuang and Masafumi Nishimura},
  title={{Japanese pitch conversion for voice morphing based on differential modeling}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2651--2654},
  doi={10.21437/Interspeech.2009-497}
}