This paper describes the analysis of the voice-quality features of "anger", "joy", and "sadness" depending on the degree of the emotion for expressions in Japanese speech. The degrees of emotion were "neutral", "light", "medium" and "strong". Among voice-quality features, we turned to the noise level of the glottalflow waveform. We adopted the AR model and measured the noise levels of the predictive residual signal of speech that expressed each emotion. To measure a relative noise level to the signal level, the "noise-to-signal (N/S) ratio" was introduced. The analysis results showed that the relative noise levels in the residual-waveform spectra were different, i.e., the N/S ratio of each emotion was larger in the order of "anger" > "sadness". "neutral" > "joy" by approximately 4 dB.
Bibliographic reference. Takeda, Shoichi / Yasuda, Yuuri / Isobe, Risako / Kiryu, Shogo / Tsuru, Makiko (2008): "Analysis of voice-quality features of speech that expresses "anger", "joy", and "sadness" uttered by radio actors and actresses", In INTERSPEECH-2008, 2114-2117.