15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Automatic Recognition of Attitudes in Video Blogs — Prosodic and Visual Feature Analysis

Noor Alhusna Madzlan, JingGuang Han, Francesca Bonin, Nick Campbell

Trinity College Dublin, Ireland

This paper reports a study of attitude manifestations in video blogs. We describe the manual annotation of speaker attitudes in a corpus of over 130 video blogs and present an analysis of prosodic and visual cues in relation to attitude states. We use machine learning techniques for the automatic prediction of attitudes from prosodic and visual features in video blogs and compare the performance of prosodic and visual feature sets.

Full Paper

Bibliographic reference.  Madzlan, Noor Alhusna / Han, JingGuang / Bonin, Francesca / Campbell, Nick (2014): "Automatic recognition of attitudes in video blogs — prosodic and visual feature analysis", In INTERSPEECH-2014, 1826-1830.