ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A new Japanese TTS system based on speech-prosody database and speech modification

Mitsuaki Isogai, Kimihito Tanaka, Satoshi Takano, Hideyuki Mizuno, Masanobu Abe, Sin’ya Nakajima

This paper describes a new Japanese text-to-speech (TTS) system that can produce highly natural and intelligible synthetic speech. The good performance of the new TTS system derives from three new sophisticated approaches as follows; (1) A new prosody control algorithm that uses prosody data extracted from a natural speech database and a duration control algorithm based on statistical estimation. (2) A new type of synthesis unit that consists of a consonant with following vowel chain. The unit suppresses unnatural sounds and acoustic discontinuities at concatenation points by preparing synthesis units with various lengths and various F0 contours. (3) A new speech modification algorithm with harmonics reconstruction. To evaluate the new modules and the total performance of the new TTS system, listening tests are carried out. The results confirm that the new modules work together effectively, and that the new TTS system can produce high quality synthesized speech.


Cite as: Isogai, M., Tanaka, K., Takano, S., Mizuno, H., Abe, M., Nakajima, S. (2000) A new Japanese TTS system based on speech-prosody database and speech modification. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 342-345

@inproceedings{isogai00_icslp,
  author={Mitsuaki Isogai and Kimihito Tanaka and Satoshi Takano and Hideyuki Mizuno and Masanobu Abe and Sin’ya Nakajima},
  title={{A new Japanese TTS system based on speech-prosody database and speech modification}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 342-345}
}