ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A quantitative description of German prosody offering symbolic labels as a by-product

Hansjörg Mixdorff, Hiroya Fujisaki

The prosodic quality of a text-to-speech system is important for the intellegibility and perceived naturalness of synthetic speech. In earlier works the author developed a linguistically motivated model of German intonation based on the quantitative Fujisaki model of the production process of F0. The current paper compares results yielded by automatic Fujisaki modeling with a GToBI-style anotation. On the accent level, a good correlation between tone labels and accent commands can be observed. On the phrase level, most level 3 and 4 break index boundaries are aligned with phrase commands whereas lower level boundaries are presumably marked with durational cues. Subsequently a regression model of syllable duration is introduced which permits to decompose the measured duration contour into an extrinsic and an intrinsic component.


Cite as: Mixdorff, H., Fujisaki, H. (2000) A quantitative description of German prosody offering symbolic labels as a by-product. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 98-101

@inproceedings{mixdorff00_icslp,
  author={Hansjörg Mixdorff and Hiroya Fujisaki},
  title={{A quantitative description of German prosody offering symbolic labels as a by-product}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 98-101}
}