Interspeech'2005 - Eurospeech
In this paper, we advocate for the usage of word-level pitch features for detecting user emotional states during spoken tutoring dialogues. Prior research has primarily focused on the use of turnlevel features as predictors. We compute pitch features at the word level and resolve the problem of combining multiple features per turn using a word-level emotion model. Even under a very simple word-level emotion model, our results show an improvement in prediction using word-level features over using turn-level features. We find that the advantage of word-level features lies in a better prediction of longer turns.
Bibliographic reference. Rotaru, Mihai / Litman, Diane J. (2005): "Using word-level pitch features to better predict student emotions during spoken tutoring dialogues", In INTERSPEECH-2005, 881-884.