• DocumentCode
    716157
  • Title

    VT-KFER: A Kinect-based RGBD+time dataset for spontaneous and non-spontaneous facial expression recognition

  • Author

    Aly, Sherin ; Trubanova, Andrea ; Abbott, Lynn ; White, Susan ; Youssef, Amira

  • Author_Institution
    Bradley Dept. of Electr. & Comput. Eng., Virginia Tech, Blacksburg, VA, USA
  • fYear
    2015
  • fDate
    19-22 May 2015
  • Firstpage
    90
  • Lastpage
    97
  • Abstract
    Human facial expressions have been extensively studied using 2D static images or 2D video sequences. The main limitations of 2D-based analysis are problems associated with large variations in pose and illumination. Therefore, an alternative is to utilize depth information, captured from 3D sensors, which is both pose and illumination invariant. The Kinect sensor is an inexpensive, portable, and fast way to capture the depth information. However, only a few researchers have utilized the Kinect sensor for the automatic recognition of facial expressions. This is partly due to the lack of a Kinect-based publicly available RGBD facial expression recognition (FER) dataset that contains the relevant facial expressions and their associated semantic labels. This paper addresses this problem by presenting the first publicly available RGBD+time facial expression recognition dataset using the Kinect 1.0 sensor in both scripted (acted) and unscripted (spontaneous) scenarios. Our fully annotated dataset includes seven expressions (happiness, sadness, surprise, disgust, fear, anger, and neutral) for 32 subjects (males and females) aged from 10 to 30 and with different skin tones. Both human and machine evaluation were conducted. Each scripted expression was ranked quantitatively by two research assistants in the Psychology department. Baseline machine evaluation resulted in average recognition accuracy levels of 60% and 58.3% for 6 expressions and 7 expressions recognition, respectively, when features from 2D and 3D data were combined.
  • Keywords
    face recognition; image colour analysis; image sensors; 2D static images; 2D video sequences; 3D sensors; Kinect 1.0 sensor; Kinect-based RGBD facial expression recognition; Kinect-based RGBD+time dataset; VT-KFER; depth information; nonspontaneous facial expression recognition; spontaneous facial expression recognition; Databases; Face; Face recognition; Games; Lighting; Sensors; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Biometrics (ICB), 2015 International Conference on
  • Conference_Location
    Phuket
  • Type

    conf

  • DOI
    10.1109/ICB.2015.7139081
  • Filename
    7139081