DocumentCode :
1758874
Title :
Action Recognition From Video Using Feature Covariance Matrices
Author :
Kai Guo ; Ishwar, Prakash ; Konrad, Janusz
Author_Institution :
Dept. of Electr. & Comput. Eng., Boston Univ., Boston, MA, USA
Volume :
22
Issue :
6
fYear :
2013
fDate :
41426
Firstpage :
2479
Lastpage :
2494
Abstract :
We propose a general framework for fast and accurate recognition of actions in video using empirical covariance matrices of features. A dense set of spatio-temporal feature vectors are computed from video to provide a localized description of the action, and subsequently aggregated in an empirical covariance matrix to compactly represent the action. Two supervised learning methods for action recognition are developed using feature covariance matrices. Common to both methods is the transformation of the classification problem in the closed convex cone of covariance matrices into an equivalent problem in the vector space of symmetric matrices via the matrix logarithm. The first method applies nearest-neighbor classification using a suitable Riemannian metric for covariance matrices. The second method approximates the logarithm of a query covariance matrix by a sparse linear combination of the logarithms of training covariance matrices. The action label is then determined from the sparse coefficients. Both methods achieve state-of-the-art classification performance on several datasets, and are robust to action variability, viewpoint changes, and low object resolution. The proposed framework is conceptually simple and has low storage and computational requirements making it attractive for real-time implementation.
Keywords :
covariance matrices; image recognition; video cameras; video signal processing; Riemannian metric; classification problem transformation; closed convex cone; feature covariance matrices; query covariance matrix; spatio-temporal feature vectors; supervised learning methods; training covariance matrices; video action recognition; Covariance matrix; Linear approximation; Measurement; Sparse matrices; Symmetric matrices; Training; Vectors; Action recognition; Riemannian metric; feature covariance matrix; nearest-neighbor (NN) classifier; optical flow; silhouette tunnel; sparse linear approximation (SLA); video analysis;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2013.2252622
Filename :
6479703
Link To Document :
بازگشت