ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Speaker dependent emotion recognition using speech signals

Bong-Seok Kang, Chul-Hee Han, Sang-Tae Lee, Dae-Hee Youn, Chungyong Lee

This paper examines three algorithms to recognize speakerÂ’s emotion using the speech signals. Target emotions are happiness, sadness, anger, fear, boredom and neutral state. MLB (Maximum-Likelihood Bayes), NN (Nearest Neighbor) and HMM (Hidden Markov Model) algorithms are used as the pattern matching techniques. In all cases, pitch and energy are used as the features. The feature vectors for MLB and NN are composed of pitch mean, pitch standard deviation, energy mean, energy standard deviation, etc. For HMM, vectors of delta pitch with delta-delta pitch and delta energy with delta-delta energy are used. A corpus of emotional speech data was recorded and the subjective evaluation of the data was performed by 23 untrained listeners. The subjective recognition result was 56% and was compared with the classifiersÂ’ recognition rates. MLB, NN, and HMM classifiers achieved recognition rates of 68.9%, 69.3%, and 89.1%, respectively, for the speaker dependent and context-independent classification.


Cite as: Kang, B.-S., Han, C.-H., Lee, S.-T., Youn, D.-H., Lee, C. (2000) Speaker dependent emotion recognition using speech signals. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 383-386

@inproceedings{kang00_icslp,
  author={Bong-Seok Kang and Chul-Hee Han and Sang-Tae Lee and Dae-Hee Youn and Chungyong Lee},
  title={{Speaker dependent emotion recognition using speech signals}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 383-386}
}