ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Acoustic markers of emotions based on voice physiology

Sona Patel, Klaus R. Scherer, Johan Sundberg, Eva Björkner

Acoustic models of emotions may benefit from considering the underlying voice production mechanism. This study sought to describe emotional expressions according to physiological variations measured from the inverse-filtered glottal waveform in addition to standard parameter extraction. An acoustic analysis was performed on a subset of the /a/ vowels within the GEMEP database (10 speakers, 5 emotions). Of the 12 acoustic features computed, repeated measures ANOVA showed significant main effects for 11 parameters. Subsequent principal components analysis revealed the three components that explain acoustic variations due to emotion, including “tension” (CQ, H1-H2, MFDR, LTAS) “perturbation” (jitter, shimmer, HNR), and “voicing” (fundamental frequency).

Index Terms: emotion, vocal expression, acoustic cues, voice quality, physiology, glottal waveform, affect bursts

Cite as: Patel, S., Scherer, K.R., Sundberg, J., Björkner, E. (2010) Acoustic markers of emotions based on voice physiology. Proc. Speech Prosody 2010, paper 865

  author={Sona Patel and Klaus R. Scherer and Johan Sundberg and Eva Björkner},
  title={{Acoustic markers of emotions based on voice physiology}},
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 865}