14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Active Learning by Label Uncertainty for Acoustic Emotion Recognition

Zixing Zhang (1), Jun Deng (1), Erik Marchi (1), Björn Schuller (2)

(1) Technische Universität München, Germany
(2) Universität Passau, Germany

Speech data is in principle available in large amounts for the training of acoustic emotion recognisers. However, emotional labelling is usually not given and the distribution is heavily unbalanced, as most data is erather neutralf than truly eemotionalf. In the ehay stackf of speech data, Active Learning automatically identifies the eneedlesf, i.e., the more informative instances to reduce human labelling effort when building a classifier, e.g., for acoustic emotion recognition. The critical issue thus is the determination and quantification of informativeness. To this end, we suggest to exploit the reliability of the usual ambiguity of emotional labels, i.e., we propose a novel approach based on label uncertainty. By building a certainty model and predicting the candidate instances, informativeness is thus based on labeller agreement. In addition, we consider class sparseness. The results of extensive test runs under well standardised conditions show the method's great potential in reducing labelling costs while boosting performance.

Full Paper

Bibliographic reference.  Zhang, Zixing / Deng, Jun / Marchi, Erik / Schuller, Björn (2013): "Active learning by label uncertainty for acoustic emotion recognition", In INTERSPEECH-2013, 2856-2860.