ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

A novel model-based pitch conversion method for Mandarin speech

Hsin-Te Hwang, Chen-Yu Chiang, Po-Yi Sung, Sin-Horng Chen

In this paper, a novel model-based pitch conversion method for Mandarin is presented and compared with other two conventional conversion methods, i.e. the mean/variance transformation approach and the GMM-based mapping approach. Syllable pitch contour is first quantized by 3rd order orthogonal expansion coefficients; then, the source and target speakersÂ’ prosodic models are constructed, respectively. Two mapping methods based on the prosodic model are presented. Objective tests confirmed that one of the proposed methods are superior the conventional methods. Some findings in informal listening tests and objective tests are worthwhile to further investigate.


doi: 10.21437/Interspeech.2009-495

Cite as: Hwang, H.-T., Chiang, C.-Y., Sung, P.-Y., Chen, S.-H. (2009) A novel model-based pitch conversion method for Mandarin speech. Proc. Interspeech 2009, 2643-2646, doi: 10.21437/Interspeech.2009-495

@inproceedings{hwang09_interspeech,
  author={Hsin-Te Hwang and Chen-Yu Chiang and Po-Yi Sung and Sin-Horng Chen},
  title={{A novel model-based pitch conversion method for Mandarin speech}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2643--2646},
  doi={10.21437/Interspeech.2009-495}
}