ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Puretalk: a high quality Japanese text-to-speech system

Masayuki Yamada, Yasuo Okutani, Toshiaki Fukada, Takashi Aso, Yasuhiro Komori

This paper describes a high quality Japanese text to speech (TTS) system, PureTalk. This system is similar to the conventional diphone-based TTS using PSOLA except that PureTalk employs the following novel tech- niques which enable to produce more intelligible and nat- ural-sounding speech: 1) two-stage duration modeling based on a linear regression technique, 2) F0 contour modeling using polynomial segment models, 3) sophisti- cated waveform unit selection, and 4) ecient waveform compression designed for TTS system. The result of the subjective hearing test shows that PureTalk achieves high quality under practical computation and memory requirement.


Cite as: Yamada, M., Okutani, Y., Fukada, T., Aso, T., Komori, Y. (2000) Puretalk: a high quality Japanese text-to-speech system. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 403-406

@inproceedings{yamada00_icslp,
  author={Masayuki Yamada and Yasuo Okutani and Toshiaki Fukada and Takashi Aso and Yasuhiro Komori},
  title={{Puretalk: a high quality Japanese text-to-speech system}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 403-406}
}