Title :
Action Recognition Using Nonnegative Action Component Representation and Sparse Basis Selection
Author :
Haoran Wang ; Chunfeng Yuan ; Weiming Hu ; Haibin Ling ; Wankou Yang ; Changyin Sun
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
Abstract :
In this paper, we propose using high-level action units to represent human actions in videos and, based on such units, a novel sparse model is developed for human action recognition. There are three interconnected components in our approach. First, we propose a new context-aware spatial-temporal descriptor, named locally weighted word context, to improve the discriminability of the traditionally used local spatial-temporal descriptors. Second, from the statistics of the context-aware descriptors, we learn action units using the graph regularized nonnegative matrix factorization, which leads to a part-based representation and encodes the geometrical information. These units effectively bridge the semantic gap in action recognition. Third, we propose a sparse model based on a joint l2,1-norm to preserve the representative items and suppress noise in the action units. Intuitively, when learning the dictionary for action representation, the sparse model captures the fact that actions from the same class share similar units. The proposed approach is evaluated on several publicly available data sets. The experimental results and analysis clearly demonstrate the effectiveness of the proposed approach.
Keywords :
graph theory; image coding; image recognition; image representation; matrix decomposition; noise abatement; statistical analysis; context-aware spatial-temporal descriptor; geometrical information encoding; graph regularized nonnegative matrix factorization; human action recognition; human action representation; interconnected component; joint l2,1-norm sparse model; locally weighted word context; noise suppression; nonnegative action component representation; semantic gap; sparse basis selection; Context; Dictionaries; Feature extraction; Semantics; Vectors; Videos; Visualization; Action unit; action recognition; nonnegative matrix factorization; sparse representation;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2013.2292550