To achieve efficient and compact low-dimensional features for speech emotion recognition, this paper proposes a novel feature reduction method using multiscale kernels in the framework of graph embedding. With Fisher discriminant embedding graph, multiscale Gaussian kernels are used in constructing optimal linear combination of Gram matrices for multiple kernel learning. To evaluate the proposed method, comprehensive experiments, using different public feature sets from the open-source toolbox openSMILE on various corpora, show that the proposed method achieves better performance compared with conventional linear dimensionality reduction methods and single-kernel methods.
Cite as: Xu, X., Deng, J., Zheng, W., Zhao, L., Schuller, B. (2015) Dimensionality reduction for speech emotion features by multiscale kernels. Proc. Interspeech 2015, 1532-1536, doi: 10.21437/Interspeech.2015-335
@inproceedings{xu15b_interspeech, author={Xinzhou Xu and Jun Deng and Wenming Zheng and Li Zhao and Björn Schuller}, title={{Dimensionality reduction for speech emotion features by multiscale kernels}}, year=2015, booktitle={Proc. Interspeech 2015}, pages={1532--1536}, doi={10.21437/Interspeech.2015-335} }