Title :
Spatiotemporal Local Monogenic Binary Patterns for Facial Expression Recognition
Author :
Huang, Xiaohua ; Zhao, Guoying ; Zheng, Wenming ; Pietikäinen, Matti
Author_Institution :
Key Lab. of Child Dev. & Learning Sci., Southeast Univ., Nanjing, China
fDate :
5/1/2012 12:00:00 AM
Abstract :
Feature representation is an important research topic in facial expression recognition from video sequences. In this letter, we propose to use spatiotemporal monogenic binary patterns to describe both appearance and motion information of the dynamic sequences. Firstly, we use monogenic signals analysis to extract the magnitude, the real picture and the imaginary picture of the orientation of each frame, since the magnitude can provide much appearance information and the orientation can provide complementary information. Secondly, the phase-quadrant encoding method and the local bit exclusive operator are utilized to encode the real and imaginary pictures from orientation in three orthogonal planes, and the local binary pattern operator is used to capture the texture and motion information from the magnitude through three orthogonal planes. Finally, both concatenation method and multiple kernel learning method are respectively exploited to handle the feature fusion. The experimental results on the Extended Cohn-Kanade and Oulu-CASIA facial expression databases demonstrate that the proposed methods perform better than the state-of-the-art methods, and are robust to illumination variations.
Keywords :
encoding; face recognition; image representation; image sequences; image texture; video coding; Oulu-CASIA facial expression databases; concatenation method; extended Cohn-Kanade facial expression databases; facial expression recognition; feature fusion; feature representation; imaginary pictures; kernel learning method; local binary pattern operator; monogenic signals analysis; motion information; orthogonal planes; phase-quadrant encoding method; spatiotemporal local monogenic binary patterns; texture information; video sequences; Encoding; Face; Facial features; Histograms; Kernel; Spatiotemporal phenomena; Video sequences; Magnitude; monogenic filter; multiple kernel learning; orientation;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2012.2188890