ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

On a pitch alteration technique in excited cepstral spectrum for high quality TTS

JongDeuk Kim, SeongJoon Baek, MyungJin Bae

In the area of the speech synthesis techniques, the waveform coding methods maintain the intelligibility and naturalness of synthetic speech. In order to apply the waveform coding or hybrid coding techniques to synthesis by rule, we must be able to alter the pitches of synthetic speech. In this paper, we propose a new pitch alteration method that minimizes the spectrum distortion by using the behavior of cepstrum. This method splits the spectrum of speech signal into excitation spectrum and formant spectrum and transforms the excitation spectrum into cepstrum domain. The pitch of excitation cepstrum is altered by zero insertion or zero deletion and the pitch altered spectrum is reconstructed in spectrum domain. As a result of performance test, the average spectrum distortion was below 2.29% while that of conventional method is 2.47%.


doi: 10.21437/ICSLP.1998-108

Cite as: Kim, J., Baek, S., Bae, M. (1998) On a pitch alteration technique in excited cepstral spectrum for high quality TTS. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1020, doi: 10.21437/ICSLP.1998-108

@inproceedings{kim98_icslp,
  author={JongDeuk Kim and SeongJoon Baek and MyungJin Bae},
  title={{On a pitch alteration technique in excited cepstral spectrum for high quality TTS}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 1020},
  doi={10.21437/ICSLP.1998-108}
}