Speaker Movement Correlates with Prosodic Indicators of Engagement

Rob Voigt, Robert J. Podesva, Dan Jurafsky


Recent research on multimodal prosody has begun to identify associations between discrete body movements and categorical acoustic prosodic events such as pitch accents and boundaries. We propose to generalize this work to understand more about continuous prosodic phenomena distributed over a phrase - like those indicative of speaker engagement - and how they covary with bodily movements. We introduce movement amplitude, a new vision-based metric for estimating continuous body movements over time from video by quantifying frame-to-frame visual changes. Application of this automatic metric to a collection of video monologues demonstrates that speakers move more during phrases in which their pitch and intensity are higher and more variable. These findings offer further evidence for the relation- ship between acoustic and visual prosody, and suggest a previously unreported quantitative connection between raw bodily movement and speaker engagement.


 DOI: 10.21437/SpeechProsody.2014-2

Cite as: Voigt, R., Podesva, R.J., Jurafsky, D. (2014) Speaker Movement Correlates with Prosodic Indicators of Engagement. Proc. 7th International Conference on Speech Prosody 2014, 70-74, DOI: 10.21437/SpeechProsody.2014-2.


@inproceedings{Voigt2014,
  author={Rob Voigt and Robert J. Podesva and Dan Jurafsky},
  title={{Speaker Movement Correlates with Prosodic Indicators of Engagement}},
  year=2014,
  booktitle={Proc. 7th International Conference on Speech Prosody 2014},
  pages={70--74},
  doi={10.21437/SpeechProsody.2014-2},
  url={http://dx.doi.org/10.21437/SpeechProsody.2014-2}
}