INTERSPEECH 2006 - ICSLP
A trainable prosodic model called SFC (Superposition of Functional Contours), proposed by Holm and Bailly, is here confronted to German intonation. Training material is the publicly available Siemens Synthesis Corpus that provides spoken utterances for high-quality speech synthesis. We describe the labeling framework and first evaluation results that compares the original prosody of test sentences of this corpus with their prosodic rendering by the proposed model and state-of-the-art systems available on-line on the web.
Bibliographic reference. Bailly, Gérard / Gorisch, Jan (2006): "Generating German intonation with a trainable prosodic model", In INTERSPEECH-2006, paper 2017-Thu1FoP.3.