Action Recognition Using Nonnegative Action Component Representation and Sparse Basis Selection

Author

Haoran Wang ; Chunfeng Yuan ; Weiming Hu ; Haibin Ling ; Wankou Yang ; Changyin Sun

Author_Institution

Sch. of Autom., Southeast Univ., Nanjing, China

Volume

23

Issue

2

fYear

2014

fDate

Feb. 2014

Firstpage

570

Lastpage

581

Abstract

In this paper, we propose using high-level action units to represent human actions in videos and, based on such units, a novel sparse model is developed for human action recognition. There are three interconnected components in our approach. First, we propose a new context-aware spatial-temporal descriptor, named locally weighted word context, to improve the discriminability of the traditionally used local spatial-temporal descriptors. Second, from the statistics of the context-aware descriptors, we learn action units using the graph regularized nonnegative matrix factorization, which leads to a part-based representation and encodes the geometrical information. These units effectively bridge the semantic gap in action recognition. Third, we propose a sparse model based on a joint l2,1-norm to preserve the representative items and suppress noise in the action units. Intuitively, when learning the dictionary for action representation, the sparse model captures the fact that actions from the same class share similar units. The proposed approach is evaluated on several publicly available data sets. The experimental results and analysis clearly demonstrate the effectiveness of the proposed approach.

Keywords

graph theory; image coding; image recognition; image representation; matrix decomposition; noise abatement; statistical analysis; context-aware spatial-temporal descriptor; geometrical information encoding; graph regularized nonnegative matrix factorization; human action recognition; human action representation; interconnected component; joint l_2,1-norm sparse model; locally weighted word context; noise suppression; nonnegative action component representation; semantic gap; sparse basis selection; Context; Dictionaries; Feature extraction; Semantics; Vectors; Videos; Visualization; Action unit; action recognition; nonnegative matrix factorization; sparse representation;

fLanguage

English

Journal_Title

Image Processing, IEEE Transactions on

Publisher

ieee

ISSN

1057-7149

Type

jour

DOI

10.1109/TIP.2013.2292550

Filename

6675065