ISCA Archive PPST 1991
Neglected dimensions in speech synthesis

Björn Granström, Lennart Nord

In traditional accounts on speech prosody, fundamental frequency, duration and intensity have been described as the most important attributes. Among these, intensity has attracted the least attention. In perceptual studies both F0 and duration has had an undisputable role in signalling prosodic categories but the role of intensity has been less clear. This has resulted in an emphasis on the former attributes in current speech synthesis schemes. We are in this study exploring the use of speech intensity and also other segmental correlates of prosody. Intensity has a dynamic aspect, discriminating emphasized and reduced stretches of speech. A more global aspect of intensity must be controlled when we try to model different speaking styles. Specifically, we have been trying to model the continuum from soft to loud speech.

Cite as: Granström, B., Nord, L. (1991) Neglected dimensions in speech synthesis. Proc. ESCA Workshop on Phonetics and Phonology of Speaking Styles, paper 027

  author={Björn Granström and Lennart Nord},
  title={{Neglected dimensions in speech synthesis}},
  booktitle={Proc. ESCA Workshop on Phonetics and Phonology of Speaking Styles},
  pages={paper 027}