• DocumentCode
    1693903
  • Title

    A comparative study of different classifiers for detecting depression from spontaneous speech

  • Author

    Alghowinem, Sharifa ; Goecke, Roland ; Wagner, Michael ; Epps, Julien ; Gedeon, Tom ; Breakspear, Michael ; Parker, Gordon

  • Author_Institution
    Australian Nat. Univ., Canberra, ACT, Australia
  • fYear
    2013
  • Firstpage
    8022
  • Lastpage
    8026
  • Abstract
    Accurate detection of depression from spontaneous speech could lead to an objective diagnostic aid to assist clinicians to better diagnose depression. Little thought has been given so far to which classifier performs best for this task. In this study, using a 60-subject real-world clinically validated dataset, we compare three popular classifiers from the affective computing literature - Gaussian Mixture Models (GMM), Support Vector Machines (SVM) and Multilayer Perceptron neural networks (MLP) - as well as the recently proposed Hierarchical Fuzzy Signature (HFS) classifier. Among these, a hybrid classifier using GMM models and SVM gave the best overall classification results. Comparing feature, score, and decision fusion, score fusion performed better for GMM, HFS and MLP, while decision fusion worked best for SVM (both for raw data and GMM models). Feature fusion performed worse than other fusion methods in this study. We found that loudness, root mean square, and intensity were the voice features that performed best to detect depression in this dataset.
  • Keywords
    Gaussian processes; fuzzy set theory; mean square error methods; medical disorders; medical signal processing; multilayer perceptrons; patient diagnosis; signal classification; speech processing; support vector machines; GMM models; Gaussian mixture models; HFS classifier; MLP; SVM; affective computing; clinician assistance; decision fusion; depression detection; depression diagnosis; hierarchical fuzzy signature classifier; hybrid classifier; loudness; multilayer perceptron neural networks; objective diagnostic aid; root mean square; score fusion; spontaneous speech; support vector machines; voice feature intensity; Australia; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech recognition; Support vector machines; Mood detection; affective sensing; classifier comparison; clinical depression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639227
  • Filename
    6639227