12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Automatic Detection of Depression in Speech Using Gaussian Mixture Modeling with Factor Analysis

Douglas Sturim, Pedro A. Torres-Carrasquillo, Thomas F. Quatieri, Nicolas Malyska, Alan McCree

MIT Lincoln Laboratory, USA

Of increasing importance in the civilian and military population is the recognition of Major Depressive Disorder at its earliest stages and intervention before the onset of severe symptoms. Toward the goal of more effective monitoring of depression severity, we investigate automatic classifiers of depression state, that have the important property of mitigating nuisances due to data variability, such as speaker and channel effects, unrelated to levels of depression. To assess our measures, we use a 35-speaker free-response speech database of subjects treated for depression over a six-week duration, along with standard clinical HAMD depression ratings. Preliminary experiments indicate that by mitigating nuisances, thus focusing on depression severity as a class, we can significantly improve classification accuracy over baseline Gaussian-mixturemodel- based classifiers.

Full Paper

Bibliographic reference.  Sturim, Douglas / Torres-Carrasquillo, Pedro A. / Quatieri, Thomas F. / Malyska, Nicolas / McCree, Alan (2011): "Automatic detection of depression in speech using Gaussian mixture modeling with factor analysis", In INTERSPEECH-2011, 2981-2984.