مرکز منطقه ای اطلاع رساني علوم و فناوري - VT-KFER: A Kinect-based RGBD+time dataset for spontaneous and non-spontaneous facial expression recognition

DocumentCode :

716157

Title :

VT-KFER: A Kinect-based RGBD+time dataset for spontaneous and non-spontaneous facial expression recognition

Author :

Aly, Sherin ; Trubanova, Andrea ; Abbott, Lynn ; White, Susan ; Youssef, Amira

Author_Institution :

Bradley Dept. of Electr. & Comput. Eng., Virginia Tech, Blacksburg, VA, USA

fYear :

2015

fDate :

19-22 May 2015

Firstpage :

Lastpage :

Abstract :

Human facial expressions have been extensively studied using 2D static images or 2D video sequences. The main limitations of 2D-based analysis are problems associated with large variations in pose and illumination. Therefore, an alternative is to utilize depth information, captured from 3D sensors, which is both pose and illumination invariant. The Kinect sensor is an inexpensive, portable, and fast way to capture the depth information. However, only a few researchers have utilized the Kinect sensor for the automatic recognition of facial expressions. This is partly due to the lack of a Kinect-based publicly available RGBD facial expression recognition (FER) dataset that contains the relevant facial expressions and their associated semantic labels. This paper addresses this problem by presenting the first publicly available RGBD+time facial expression recognition dataset using the Kinect 1.0 sensor in both scripted (acted) and unscripted (spontaneous) scenarios. Our fully annotated dataset includes seven expressions (happiness, sadness, surprise, disgust, fear, anger, and neutral) for 32 subjects (males and females) aged from 10 to 30 and with different skin tones. Both human and machine evaluation were conducted. Each scripted expression was ranked quantitatively by two research assistants in the Psychology department. Baseline machine evaluation resulted in average recognition accuracy levels of 60% and 58.3% for 6 expressions and 7 expressions recognition, respectively, when features from 2D and 3D data were combined.

Keywords :

face recognition; image colour analysis; image sensors; 2D static images; 2D video sequences; 3D sensors; Kinect 1.0 sensor; Kinect-based RGBD facial expression recognition; Kinect-based RGBD+time dataset; VT-KFER; depth information; nonspontaneous facial expression recognition; spontaneous facial expression recognition; Databases; Face; Face recognition; Games; Lighting; Sensors; Three-dimensional displays;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Biometrics (ICB), 2015 International Conference on

Conference_Location :

Phuket

Type :

conf

DOI :

10.1109/ICB.2015.7139081

Filename :

7139081

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=716157