ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Speaker-independent emotion recognition based on feature vector classification

Jeong-Sik Park, Ji-Hwan Kim, Sang-Min Yoon, Yung-Hwan Oh

This paper proposes a new feature vector classification for speech emotion recognition. The conventional feature vector classification applied to speaker identification categorized feature vectors as overlapped and non-overlapped. This method discards all of the overlapped vectors in model training, while non-overlapped vectors are used to reconstruct corresponding speaker models. Although the conventional classification showed strong performance in speaker identification, it has limitations in constructing robust models when the number of overlapped vectors is significantly increased such as in emotion recognition. To overcome such a drawback, we propose a more sophisticated classification method which selects discriminative vectors among overlapped vectors and adds the vectors in model reconstruction. On experiments based on an LDC emotion corpus, our classification approach exhibited superior performance when compared to the conventional method.

doi: 10.21437/Interspeech.2008-688

Cite as: Park, J.-S., Kim, J.-H., Yoon, S.-M., Oh, Y.-H. (2008) Speaker-independent emotion recognition based on feature vector classification. Proc. Interspeech 2008, 2775-2778, doi: 10.21437/Interspeech.2008-688

  author={Jeong-Sik Park and Ji-Hwan Kim and Sang-Min Yoon and Yung-Hwan Oh},
  title={{Speaker-independent emotion recognition based on feature vector classification}},
  booktitle={Proc. Interspeech 2008},