مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-view descriptor mining via codeword net for action recognition

DocumentCode :

3707324

Title :

Multi-view descriptor mining via codeword net for action recognition

Author :

Jingyu Liu;Yongzhen Huang;Xiaojiang Peng;Liang Wang

Author_Institution :

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

fYear :

2015

Firstpage :

793

Lastpage :

797

Abstract :

Action recognition is an important yet challenging task in computer vision. A successful and widely used framework in this field is the Bag of Visual Words (BoVW), wherein the first step is to extract local features. One critical property of local features is that they are often multi-view, e.g., dense trajectory feature includes both appearance and motion properties. Different types of features are aligned together in coding and pooling thus leading the process to be heavily entangled. Our motivation is to disentangle each sub-descriptor and let them contribute to the maximum extent. To achieve this, a codeword net is constructed via exploiting the relation between features and codewords. Based on the codeword net, features from the same viewpoint are pooled together. Experiments on two large scale action recognition datasets, UCF50 and HMDB51, demonstrate that our approach can enhance the state-of-the-art algorithms.

Keywords :

"Encoding","Computer vision","Feature extraction","Pattern recognition","Joining processes","Visualization","Trajectory"

Publisher :

ieee

Conference_Titel :

Image Processing (ICIP), 2015 IEEE International Conference on

Type :

conf

DOI :

10.1109/ICIP.2015.7350908

Filename :

7350908

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3707324