12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Weighted Ordered Classes - Nearest Neighbors: A New Framework for Automatic Emotion Recognition from Speech

Yazid Attabi (1), Pierre Dumouchel (2)

(1) École de Technologie Supérieure, Canada
(2) CRIM, Canada

In this paper we present a new framework for emotion recognition from speech based on a similarity concept called Weighted Ordered Classes-Nearest Neighbors (WOC-NN). Unlike the k-nearest neighbor, an instance-similarity based method; WOC-NN computes similarities between a test instance and a class pattern of each emotion class. An emotion class pattern is a representation of its ranked neighboring classes. A Hamming distance is used as distance metric, enhanced with two improvements: i) weighting the importance of each class rank of each neighborhood pattern and ii) discarding irrelevant class ranks from the patterns. Thus, the decision process in WOC-NN exploits more information than Bayes rule which makes use only of the information about the model class that minimizes Bayes risk. This extra information allows WOC-NN to get more accurate prediction. Also, the results show that the proposed system outperforms the result of state-of-the art systems when applied to the FAU AIBO corpus.

Full Paper

Bibliographic reference.  Attabi, Yazid / Dumouchel, Pierre (2011): "Weighted ordered classes - nearest neighbors: a new framework for automatic emotion recognition from speech", In INTERSPEECH-2011, 3125-3128.