DocumentCode :
3672171
Title :
Bilinear heterogeneous information machine for RGB-D action recognition
Author :
Yu Kong;Yun Fu
Author_Institution :
Department of Electrical and Computer Engineering, Northeastern University, Boston, MA, USA
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
1054
Lastpage :
1062
Abstract :
This paper proposes a novel approach to action recognition from RGB-D cameras, in which depth features and RGB visual features are jointly used. Rich heterogeneous RGB and depth data are effectively compressed and projected to a learned shared space, in order to reduce noise and capture useful information for recognition. Knowledge from various sources can then be shared with others in the learned space to learn cross-modal features. This guides the discovery of valuable information for recognition. To capture complex spatiotemporal structural relationships in visual and depth features, we represent both RGB and depth data in a matrix form. We formulate the recognition task as a low-rank bilinear model composed of row and column parameter matrices. The rank of the model parameter is minimized to build a low-rank classifier, which is beneficial for improving the generalization power. The proposed method is extensively evaluated on two public RGB-D action datasets, and achieves state-of-the-art results. It also shows promising results if RGB or depth data are missing in training or testing procedure.
Keywords :
"Computational modeling","Spatiotemporal phenomena","Visualization","Noise","Feature extraction","Training","Cameras"
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2015.7298708
Filename :
7298708
Link To Document :
بازگشت