مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi task sequence learning for depression scale prediction from video

DocumentCode :

3703363

Title :

Multi task sequence learning for depression scale prediction from video

Author :

Linlin Chao;Jianhua Tao;Minghao Yang;Ya Li

Author_Institution :

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences, Beijing, China

fYear :

2015

Firstpage :

526

Lastpage :

531

Abstract :

Depression is a typical mood disorder, which affects people in mental and even physical problems. People who suffer depression always behave abnormal in visual behavior and the voice. In this paper, an audio visual based multimodal depression scale prediction system is proposed. Firstly, features are extracted from video and audio are fused in feature level to represent the audio visual behavior. Secondly, long short memory recurrent neural network (LSTM-RNN) is utilized to encode the dynamic temporal information of the abnormal audio visual behavior. Thirdly, emotion information is utilized by multi-task learning to boost the performance further. The proposed approach is evaluated on the Audio-Visual Emotion Challenge (AVEC2014) dataset. Experiments results show the dimensional emotion recognition helps to depression scale prediction.

Keywords :

"Feature extraction","Visualization","Face","Shape","Training","Emotion recognition","Context"

Publisher :

ieee

Conference_Titel :

Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on

Electronic_ISBN :

2156-8111

Type :

conf

DOI :

10.1109/ACII.2015.7344620

Filename :

7344620

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3703363