DocumentCode
33911
Title
CSMMI: Class-Specific Maximization of Mutual Information for Action and Gesture Recognition
Author
Jun Wan ; Athitsos, V. ; Jangyodsuk, P. ; Escalante, Hugo Jair ; Qiuqi Ruan ; Guyon, Isabelle
Author_Institution
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
Volume
23
Issue
7
fYear
2014
fDate
Jul-14
Firstpage
3152
Lastpage
3165
Abstract
In this paper, we propose a novel approach called class-specific maximization of mutual information (CSMMI) using a submodular method, which aims at learning a compact and discriminative dictionary for each class. Unlike traditional dictionary-based algorithms, which typically learn a shared dictionary for all of the classes, we unify the intraclass and interclass mutual information (MI) into an single objective function to optimize class-specific dictionary. The objective function has two aims: 1) maximizing the MI between dictionary items within a specific class (intrinsic structure) and 2) minimizing the MI between the dictionary items in a given class and those of the other classes (extrinsic structure). We significantly reduce the computational complexity of CSMMI by introducing an novel submodular method, which is one of the important contributions of this paper. This paper also contributes a state-of-the-art end-to-end system for action and gesture recognition incorporating CSMMI, with feature extraction, learning initial dictionary per each class by sparse coding, CSMMI via submodularity, and classification based on reconstruction errors. We performed extensive experiments on synthetic data and eight benchmark data sets. Our experimental results show that CSMMI outperforms shared dictionary methods and that our end-to-end system is competitive with other state-of-the-art approaches.
Keywords
computational complexity; feature extraction; gesture recognition; CSMMI; action recognition; class-specific maximization; computational complexity; dictionary-based algorithm; discriminative dictionary; feature extraction; gesture recognition; interclass mutual information; intraclass mutual information; reconstruction errors; sparse coding; submodular method; Complexity theory; Dictionaries; Entropy; Feature extraction; Gesture recognition; Histograms; Linear programming; Gaussian Process; Intra-class mutual information; action recognition; class-specific dictionary; dictionary learning; gesture recognition; inter-class mutual information; sparse coding;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2014.2328181
Filename
6824780
Link To Document