Automatic Estimation of Perceived Sincerity from Spoken Language

Brandon M. Booth, Rahul Gupta, Pavlos Papadopoulos, Ruchir Travadi, Shrikanth S. Narayanan


Sincerity is important in everyday human communication and perception of genuineness can greatly affect emotions and outcomes in social interactions. In this paper, submitted for the INTERSPEECH 2016 Sincerity Challenge, we examine a corpus of six different types of apologetic utterances from a variety of English speakers articulated in different prosodic styles, and we rate the sincerity of each remark. Since the utterances and semantic meaning in the examined database are controlled, we focus on tone of voice by exploring a plethora of acoustic and paralinguistic features not present in the baseline model and how well they contribute to human assessment of sincerity. We show that these additional features improve the performance using the baseline model, and furthermore that conditioning learning models on the prosody of utterances boosts the prediction accuracy. Our best system outperforms the challenge baseline and in principle can generalize well to other corpora.


DOI: 10.21437/Interspeech.2016-1537

Cite as

Booth, B.M., Gupta, R., Papadopoulos, P., Travadi, R., Narayanan, S.S. (2016) Automatic Estimation of Perceived Sincerity from Spoken Language. Proc. Interspeech 2016, 2021-2025.

Bibtex
@inproceedings{Booth+2016,
author={Brandon M. Booth and Rahul Gupta and Pavlos Papadopoulos and Ruchir Travadi and Shrikanth S. Narayanan},
title={Automatic Estimation of Perceived Sincerity from Spoken Language},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1537},
url={http://dx.doi.org/10.21437/Interspeech.2016-1537},
pages={2021--2025}
}