DocumentCode :
3707324
Title :
Multi-view descriptor mining via codeword net for action recognition
Author :
Jingyu Liu;Yongzhen Huang;Xiaojiang Peng;Liang Wang
Author_Institution :
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
fYear :
2015
Firstpage :
793
Lastpage :
797
Abstract :
Action recognition is an important yet challenging task in computer vision. A successful and widely used framework in this field is the Bag of Visual Words (BoVW), wherein the first step is to extract local features. One critical property of local features is that they are often multi-view, e.g., dense trajectory feature includes both appearance and motion properties. Different types of features are aligned together in coding and pooling thus leading the process to be heavily entangled. Our motivation is to disentangle each sub-descriptor and let them contribute to the maximum extent. To achieve this, a codeword net is constructed via exploiting the relation between features and codewords. Based on the codeword net, features from the same viewpoint are pooled together. Experiments on two large scale action recognition datasets, UCF50 and HMDB51, demonstrate that our approach can enhance the state-of-the-art algorithms.
Keywords :
"Encoding","Computer vision","Feature extraction","Pattern recognition","Joining processes","Visualization","Trajectory"
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICIP.2015.7350908
Filename :
7350908
Link To Document :
بازگشت