10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Polyglot Speech Prosody Control

Harald Romsdorfer

ETH Zürich, Switzerland

Within a polyglot text-to-speech synthesis system, the generation of an adequate prosody for mixed-lingual texts, sentences, or even words, requires a polyglot prosody model that is able to seamlessly switch between languages and that applies the same voice for all languages. This paper presents the first polyglot prosody model that fulfills these requirements and that is constructed from independent monolingual prosody models. A perceptual evaluation showed that the synthetic polyglot prosody of about 82% of German and French mixed-lingual test sentences cannot be distinguished from natural polyglot prosody.

Full Paper

Bibliographic reference.  Romsdorfer, Harald (2009): "Polyglot speech prosody control", In INTERSPEECH-2009, 488-491.