7th International Conference on Spoken Language Processing
September 16-20, 2002
We present a speech synthesizer to seamlessly concatenate recorded and synthetic phrases to produce natural sounding and highly expressive speech. Not only the acoustic units, but also the F0 contours are seamlessly concatenated together from recorded and synthetic phrases. When mixed with recorded phrases, the F0 contours of synthetic phrases are generated adaptively relative to the actual surrounding F0 shapes of the recorded phrases. Although the intonation generation scheme was originally developed for unlimited speech synthesis, it is quite naturally extended to a hybrid intonation generation.
Bibliographic reference. Saito, Takashi / Sakamoto, Masaharu (2002): "Applying a hybrid intonation model to a seamless speech synthesizer", In ICSLP-2002, 165-168.