ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Modeling spectral variability for the classification of depressed speech

Nicholas Cummins, Julien Epps, Vidhyasaharan Sethu, Michael Breakspear, Roland Goecke

Quantifying how the spectral content of speech relates to changes in mental state may be crucial in building an objective speechbased depression classification system with clinical utility. This paper investigates the hypothesis that important depression based information can be captured within the covariance structure of a Gaussian Mixture Model (GMM) of recorded speech. Significant negative correlations found between a speaker's average weighted variance . a GMM-based indicator of speaker variability . and their level of depression support this hypothesis. Further evidence is provided by the comparison of classification accuracies from seven different GMM-UBM systems, each formed by varying different parameter combinations during MAP adaption. This analysis shows that variance-only adaptation either outperforms or matches the de facto standard mean-only adaptation when classifying both the presence and severity of depression. This result is perhaps the first of its kind seen in GMM-UBM speech classification.


doi: 10.21437/Interspeech.2013-242

Cite as: Cummins, N., Epps, J., Sethu, V., Breakspear, M., Goecke, R. (2013) Modeling spectral variability for the classification of depressed speech. Proc. Interspeech 2013, 857-861, doi: 10.21437/Interspeech.2013-242

@inproceedings{cummins13_interspeech,
  author={Nicholas Cummins and Julien Epps and Vidhyasaharan Sethu and Michael Breakspear and Roland Goecke},
  title={{Modeling spectral variability for the classification of depressed speech}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={857--861},
  doi={10.21437/Interspeech.2013-242}
}