ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Polyglot speech prosody control

Harald Romsdorfer

Within a polyglot text-to-speech synthesis system, the generation of an adequate prosody for mixed-lingual texts, sentences, or even words, requires a polyglot prosody model that is able to seamlessly switch between languages and that applies the same voice for all languages. This paper presents the first polyglot prosody model that fulfills these requirements and that is constructed from independent monolingual prosody models. A perceptual evaluation showed that the synthetic polyglot prosody of about 82% of German and French mixed-lingual test sentences cannot be distinguished from natural polyglot prosody.

doi: 10.21437/Interspeech.2009-182

Cite as: Romsdorfer, H. (2009) Polyglot speech prosody control. Proc. Interspeech 2009, 488-491, doi: 10.21437/Interspeech.2009-182

  author={Harald Romsdorfer},
  title={{Polyglot speech prosody control}},
  booktitle={Proc. Interspeech 2009},