9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Study on Strained Rough Voice as a Conveyer of Rage

Yumiko O. Kato, Yoshifumi Hirose, Takahiro Kamai

Matsushita Electric Industrial Co. Ltd., Japan

It is important to be able to determine anger and its degree for dialog management in an interactive speech interface. We investigated the characteristics of a strained rough voice as a conveyer of a speaker's degree of anger. In hot anger speech in Japanese, a rough voice with high glottal tension is observed frequently, and the rate of occurrence of the strained rough voice increases according to the degree of anger. In a typical male speaker's speech sample, amplitude fluctuations observed in a strained rough voice were periodic; the frequency was around between 40.80 Hz. The modulation ratio in rage speech was larger than in other emotional states, suggesting the possibility of determining the speaker's anger and its degree by detecting strained rough voice.

Full Paper

Bibliographic reference.  Kato, Yumiko O. / Hirose, Yoshifumi / Kamai, Takahiro (2008): "Study on strained rough voice as a conveyer of rage", In INTERSPEECH-2008, 249-252.