8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Voice Source and Vocal Tract Variations as Cues to Emotional States Perceived from Expressive Conversational Speech

Hiroki Mori (1), Hideki Kasuya (2)

(1) Utsunomiya University, Japan
(2) International University of Health & Welfare, Japan

Speech parameters originating from voice source and vocal tract were analyzed to find acoustic correlates of dimensional descriptions of emotional states. To achieve this goal best, we adopted the Utsunomiya University Spoken Dialogue Database, which was designed for studies on paralinguistic information in expressive conversational speech. Analyses for four female and two male speakers showed: (i) Prosodic parameters were highly correlated especially with the activation dimension, (ii) The aperiodicity-related voice source parameter showed that breathy phonation was mainly used in unpleasant utterances for three females, (iii) Due to smiling facial expression, formant frequencies were higher in pleasant utterances for a female.

Full Paper

Bibliographic reference.  Mori, Hiroki / Kasuya, Hideki (2007): "Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech", In INTERSPEECH-2007, 102-105.