DocumentCode
698136
Title
Content based clinical depression detection in adolescents
Author
Low, Lu-Shih Alex ; Maddage, Namunu C. ; Lech, Margaret ; Sheeber, Lisa ; Allen, Nicholas
Author_Institution
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
2362
Lastpage
2366
Abstract
This paper studies the effectiveness of speech contents for detecting clinical depression in adolescents. We also evaluated the performances of acoustic features such as Mel frequency cepstral coefficients (MFCC), short time energy (Energy), zero crossing rate (ZCR) and Teager energy operator (TEO) using Gaussian mixture models for depression detection. A clinical data set of speech from 139 adolescents, including 68 (49 girls and 19 boys) diagnosed as clinically depressed, was used in the classification experiments. Each subject participated in three 20 minutes interactions. The classification was first performed using the whole data and a smaller sub-set of data selected based on behavioural constructs defined by trained human observers (data with constructs). In the experiments, we found that the MFCC+Energy feature out performed the TEO feature. The results indicated that using the construct based speech contents in the problem solving interactions (PSI) session improved the detection accuracy. Accuracy was further improved by 4% when the gender dependent depression modelling technique was adopted. By using construct based PSI session speech content, gender based depression models achieved 65.1% average detection accuracy. Also, for both types of features (TEO and MFCC), the correct classification rates were higher for female speakers than for male speakers.
Keywords
gender issues; psychology; speech recognition; Gaussian mixture models; Mel frequency cepstral coefficients; Teager energy operator; acoustic features; adolescents; construct based PSI session speech content; construct based speech contents; content based clinical depression detection; gender based depression models; gender dependent depression modelling technique; problem solving interactions; short time energy; speech clinical data set; types of features; zero crossing rate; Accuracy; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Psychology; Speech; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077711
Link To Document