INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Characterising Depressed Speech for Classification

Sharifa Alghowinem (1), Roland Goecke (2), Michael Wagner (2), Julien Epps (3), Gordon Parker (3), Michael Breakspear (4)

(1) Australian National University, Australia
(2) University of Canberra, Australia
(3) University of New South Wales, Australia
(4) Queensland Institute of Medical Research, Australia

Depression is a serious psychiatric disorder that affects mood, thoughts, and the ability to function in everyday life. This paper investigates the characteristics of depressed speech for the purpose of automatic classification by analysing the effect of different speech features on the classification results. We analysed voiced, unvoiced and mixed speech in order to gain a better understanding of depressed speech and to bridge the gap between physiological and affective computing studies. This understanding may ultimately lead to an objective affective sensing system that supports clinicians in their diagnosis and monitoring of clinical depression. The characteristics of depressed speech were statistically analysed using ANOVA and linked to their classification results using GMM and SVM. Features were extracted and classified over speech utterances of 30 clinically depressed patients against 30 controls (both gender-matched) in a speaker-independent manner. Most feature classification results were consistent with their statistical characteristics, providing a link between physiological and affective computing studies. The classification results from low-level features were slightly better than the statistical functional features, which indicates a loss of information in the latter. We found that both mixed and unvoiced speech were as useful in detecting depression as voiced speech, if not better.

Full Paper

Bibliographic reference.  Alghowinem, Sharifa / Goecke, Roland / Wagner, Michael / Epps, Julien / Parker, Gordon / Breakspear, Michael (2013): "Characterising depressed speech for classification", In INTERSPEECH-2013, 2534-2538.