7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Applying a Hybrid Intonation Model to a Seamless Speech Synthesizer

Takashi Saito, Masaharu Sakamoto

IBM Japan Ltd., Japan

We present a speech synthesizer to seamlessly concatenate recorded and synthetic phrases to produce natural sounding and highly expressive speech. Not only the acoustic units, but also the F0 contours are seamlessly concatenated together from recorded and synthetic phrases. When mixed with recorded phrases, the F0 contours of synthetic phrases are generated adaptively relative to the actual surrounding F0 shapes of the recorded phrases. Although the intonation generation scheme was originally developed for unlimited speech synthesis, it is quite naturally extended to a hybrid intonation generation.


Full Paper

Bibliographic reference.  Saito, Takashi / Sakamoto, Masaharu (2002): "Applying a hybrid intonation model to a seamless speech synthesizer", In ICSLP-2002, 165-168.