Phonetics and Phonology of Speaking Styles: Reduction and Elaboration in Speech Communication

Barcelona, Catalonia, Spain
September 30 - October 2, 1991

        

Neglected Dimensions in Speech Synthesis

Björn Granström, Lennart Nord

Dept of Speech Communication & Music Acoustics, Royal Institute of Technology, KTH, Stockholm, Sweden
(authors' names in alphabetic order)

In traditional accounts on speech prosody, fundamental frequency, duration and intensity have been described as the most important attributes. Among these, intensity has attracted the least attention. In perceptual studies both F0 and duration has had an undisputable role in signalling prosodic categories but the role of intensity has been less clear. This has resulted in an emphasis on the former attributes in current speech synthesis schemes. We are in this study exploring the use of speech intensity and also other segmental correlates of prosody. Intensity has a dynamic aspect, discriminating emphasized and reduced stretches of speech. A more global aspect of intensity must be controlled when we try to model different speaking styles. Specifically, we have been trying to model the continuum from soft to loud speech.

Full Paper

Bibliographic reference.  Granström, Björn / Nord, Lennart (1991): "Neglected dimensions in speech synthesis", In PPoSpSt-1991, paper 027.