Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Study on Speaker Verification on Emotional Speech

Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huan-Jun Bao

Tsinghua University, China

Besides background noise, channel effect and speakerís health condition, emotion is another factor which may influence the performance of a speaker verification system. In this paper, the performance of a GMM-UBM based speaker verification system on emotional speech is studied. It is found that speech with various emotions aggravates the verification performance. Two reasons for the performance aggravation are analyzed, they are mismatched emotions between the speaker models and the test utterances, and the articulating styles of certain emotions which create intense intra-speaker vocal variability. In response to the first reason, an emotion-dependent score normalization method is proposed, which is borrowed from the idea of Hnorm.

Full Paper

Bibliographic reference.  Wu, Wei / Zheng, Thomas Fang / Xu, Ming-Xing / Bao, Huan-Jun (2006): "Study on speaker verification on emotional speech", In INTERSPEECH-2006, paper 1124-Wed3CaP.7.