13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition

Zixing Zhang, Björn Schuller

Institute for Human-Machine Communication, Technische Universität München, Germany

Data scarcity is an ever crucial problem in the field of acoustic emotion recognition. How to get the most informative data from a huge amount of data by least human work and at the same time to obtain the highest performance is quite important. In this paper, we propose and investigate two active learning strategies in acoustic emotion recognition: Based on sparse instances or based on classifier confidence scores. The first strategy focuses on the unbalanced problem of binary or multiple classes. And the later strategy pays more attention on clearing up the boundary confusion between different classes. Our experimental results show that by using active learning based on sparse instances or based on classifier confidence, the amount of transcribed data needed is significantly reduced and the unweigted accuracy boosts greatly as well.

Index Terms: Active Learning, Acoustic Emotion Recognition, Sparse Instances, Confidence Scores

Full Paper

Bibliographic reference.  Zhang, Zixing / Schuller, Björn (2012): "Active learning by sparse instance tracking and classifier confidence in acoustic emotion recognition", In INTERSPEECH-2012, 362-365.