Speech parameters originating from voice source and vocal tract were analyzed to find acoustic correlates of dimensional descriptions of emotional states. To achieve this goal best, we adopted the Utsunomiya University Spoken Dialogue Database, which was designed for studies on paralinguistic information in expressive conversational speech. Analyses for four female and two male speakers showed: (i) Prosodic parameters were highly correlated especially with the activation dimension, (ii) The aperiodicity-related voice source parameter showed that breathy phonation was mainly used in unpleasant utterances for three females, (iii) Due to smiling facial expression, formant frequencies were higher in pleasant utterances for a female.
Bibliographic reference. Mori, Hiroki / Kasuya, Hideki (2007): "Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech", In INTERSPEECH-2007, 102-105.