ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies

Martin Wöllmer, Florian Eyben, Stephan Reiter, Björn Schuller, Cate Cox, Ellen Douglas-Cowie, Roddy Cowie

Class based emotion recognition from speech, as performed in most works up to now, entails many restrictions for practical applications. Human emotion is a continuum and an automatic emotion recognition system must be able to recognise it as such. We present a novel approach for continuous emotion recognition based on Long Short-Term Memory Recurrent Neural Networks which include modelling of long-range dependencies between observations and thus outperform techniques like Support-Vector Regression. Transferring the innovative concept of additionally modelling emotional history to the classification of discrete levels for the emotional dimensions "valence" and "activation" we also apply Conditional Random Fields which prevail over the commonly used Support-Vector Machines. Experiments conducted on data that was recorded while humans interacted with a Sensitive Artificial Listener prove that for activation the derived classifiers perform as well as human annotators.


doi: 10.21437/Interspeech.2008-192

Cite as: Wöllmer, M., Eyben, F., Reiter, S., Schuller, B., Cox, C., Douglas-Cowie, E., Cowie, R. (2008) Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies. Proc. Interspeech 2008, 597-600, doi: 10.21437/Interspeech.2008-192

@inproceedings{wollmer08_interspeech,
  author={Martin Wöllmer and Florian Eyben and Stephan Reiter and Björn Schuller and Cate Cox and Ellen Douglas-Cowie and Roddy Cowie},
  title={{Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={597--600},
  doi={10.21437/Interspeech.2008-192}
}