ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Active learning for dimensional speech emotion recognition

Wenjing Han, Haifeng Li, Huabin Ruan, Lin Ma, Jiayin Sun, Björn Schuller

State-of-the-art dimensional speech emotion recognition systems are trained using continuously labelled instances. The data labelling process is labour intensive and time-consuming. In this paper, we propose to apply active learning to reduce according efforts: The unlabelled instances are evaluated automatically, and only the most informative ones are intelligently picked by an informativeness measure function for a human to label. Specifically, we estimate the informativeness of each unlabelled instance based on a binary-classification confidence score for an emotion being predicted to be negative or positive on a given emotional dimension. For verification, we consider a pool-based and a stream-based scenario run on part of the continuous AVEC 2012 task to demonstrate the feasibility of the proposed approach in practice. In the result, our approach requires significantly less human labelled data instances to reach a given performance than passive learning does in both scenarios.


doi: 10.21437/Interspeech.2013-247

Cite as: Han, W., Li, H., Ruan, H., Ma, L., Sun, J., Schuller, B. (2013) Active learning for dimensional speech emotion recognition. Proc. Interspeech 2013, 2841-2845, doi: 10.21437/Interspeech.2013-247

@inproceedings{han13_interspeech,
  author={Wenjing Han and Haifeng Li and Huabin Ruan and Lin Ma and Jiayin Sun and Björn Schuller},
  title={{Active learning for dimensional speech emotion recognition}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2841--2845},
  doi={10.21437/Interspeech.2013-247}
}