• DocumentCode
    33911
  • Title

    CSMMI: Class-Specific Maximization of Mutual Information for Action and Gesture Recognition

  • Author

    Jun Wan ; Athitsos, V. ; Jangyodsuk, P. ; Escalante, Hugo Jair ; Qiuqi Ruan ; Guyon, Isabelle

  • Author_Institution
    Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
  • Volume
    23
  • Issue
    7
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    3152
  • Lastpage
    3165
  • Abstract
    In this paper, we propose a novel approach called class-specific maximization of mutual information (CSMMI) using a submodular method, which aims at learning a compact and discriminative dictionary for each class. Unlike traditional dictionary-based algorithms, which typically learn a shared dictionary for all of the classes, we unify the intraclass and interclass mutual information (MI) into an single objective function to optimize class-specific dictionary. The objective function has two aims: 1) maximizing the MI between dictionary items within a specific class (intrinsic structure) and 2) minimizing the MI between the dictionary items in a given class and those of the other classes (extrinsic structure). We significantly reduce the computational complexity of CSMMI by introducing an novel submodular method, which is one of the important contributions of this paper. This paper also contributes a state-of-the-art end-to-end system for action and gesture recognition incorporating CSMMI, with feature extraction, learning initial dictionary per each class by sparse coding, CSMMI via submodularity, and classification based on reconstruction errors. We performed extensive experiments on synthetic data and eight benchmark data sets. Our experimental results show that CSMMI outperforms shared dictionary methods and that our end-to-end system is competitive with other state-of-the-art approaches.
  • Keywords
    computational complexity; feature extraction; gesture recognition; CSMMI; action recognition; class-specific maximization; computational complexity; dictionary-based algorithm; discriminative dictionary; feature extraction; gesture recognition; interclass mutual information; intraclass mutual information; reconstruction errors; sparse coding; submodular method; Complexity theory; Dictionaries; Entropy; Feature extraction; Gesture recognition; Histograms; Linear programming; Gaussian Process; Intra-class mutual information; action recognition; class-specific dictionary; dictionary learning; gesture recognition; inter-class mutual information; sparse coding;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2328181
  • Filename
    6824780