ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

A novel segment-concatenation algorithm for a cepstrum-based synthesizer

Yoshinori Shiga, Yoshiyuki Hara, Tsuneo Nitta

We propose a segment-concatenation algorithm which reduce perceived distortion caused by segment concatenation for a segment-based speech synthesizer. This algorithm concatenates six types of phonetic segments along the transient part of speech rather than the steady part, where humans have a keen sense of spectral distortion. This concatenation method enables a segment-based synthesizer to produce a smooth sound with comparatively small required storage space for the segments. We apply the algorithm to a rule-based, cepstrum-based speech synthesizer for English words. We evaluate the intelligibility of the synthetic speech through the Modified Rhyme Test (MRT)[1]. The result proved that the speech has a high intelligibility ratio of 90 percent.


Cite as: Shiga, Y., Hara, Y., Nitta, T. (1994) A novel segment-concatenation algorithm for a cepstrum-based synthesizer. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 1783-1786

@inproceedings{shiga94_icslp,
  author={Yoshinori Shiga and Yoshiyuki Hara and Tsuneo Nitta},
  title={{A novel segment-concatenation algorithm for a cepstrum-based synthesizer}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={1783--1786}
}