Title : 
VT-KFER: A Kinect-based RGBD+time dataset for spontaneous and non-spontaneous facial expression recognition
         
        
            Author : 
Aly, Sherin ; Trubanova, Andrea ; Abbott, Lynn ; White, Susan ; Youssef, Amira
         
        
            Author_Institution : 
Bradley Dept. of Electr. & Comput. Eng., Virginia Tech, Blacksburg, VA, USA
         
        
        
        
        
        
            Abstract : 
Human facial expressions have been extensively studied using 2D static images or 2D video sequences. The main limitations of 2D-based analysis are problems associated with large variations in pose and illumination. Therefore, an alternative is to utilize depth information, captured from 3D sensors, which is both pose and illumination invariant. The Kinect sensor is an inexpensive, portable, and fast way to capture the depth information. However, only a few researchers have utilized the Kinect sensor for the automatic recognition of facial expressions. This is partly due to the lack of a Kinect-based publicly available RGBD facial expression recognition (FER) dataset that contains the relevant facial expressions and their associated semantic labels. This paper addresses this problem by presenting the first publicly available RGBD+time facial expression recognition dataset using the Kinect 1.0 sensor in both scripted (acted) and unscripted (spontaneous) scenarios. Our fully annotated dataset includes seven expressions (happiness, sadness, surprise, disgust, fear, anger, and neutral) for 32 subjects (males and females) aged from 10 to 30 and with different skin tones. Both human and machine evaluation were conducted. Each scripted expression was ranked quantitatively by two research assistants in the Psychology department. Baseline machine evaluation resulted in average recognition accuracy levels of 60% and 58.3% for 6 expressions and 7 expressions recognition, respectively, when features from 2D and 3D data were combined.
         
        
            Keywords : 
face recognition; image colour analysis; image sensors; 2D static images; 2D video sequences; 3D sensors; Kinect 1.0 sensor; Kinect-based RGBD facial expression recognition; Kinect-based RGBD+time dataset; VT-KFER; depth information; nonspontaneous facial expression recognition; spontaneous facial expression recognition; Databases; Face; Face recognition; Games; Lighting; Sensors; Three-dimensional displays;
         
        
        
        
            Conference_Titel : 
Biometrics (ICB), 2015 International Conference on
         
        
            Conference_Location : 
Phuket
         
        
        
            DOI : 
10.1109/ICB.2015.7139081