• DocumentCode
    722841
  • Title

    Action recognition by Huffman coding and implicit action model

  • Author

    Nijun Li ; Tongchi Zhou ; Lin Zhou ; Zhenyang Wu

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
  • fYear
    2015
  • fDate
    12-14 June 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Human action recognition is at the core of computer vision, and has great application value in intelligent human-computer interactions. On the basis of Bag-of-Words (BoW), this work presents a Huffman coding and Implicit Action Model (IAM) combined framework for action recognition. Specifically, Huffman coding, which outperforms naïve Bayesian method, is a robust estimation of visual words´ conditional probabilities; whereas IAM captures the spatio-temporal relationships of local features and outperforms most other common machine learning methods. Spatio-Temporal Interest Points (STIPs) and Harris corners are employed as local features, and multichannel feature description is adopted to exploit the complementarity among different features. Experiments on UCF-YouTube and HOHA2 datasets systematically compare the performance of various feature channels and machine learning methods, demonstrating the effectiveness of the approaches proposed by this paper. Finally, multiple augment mechanisms such as feature fusion, hierarchical codebooks and sparse coding are integrated into the recognition system, achieving the best ever performance comparing with the state-of-the-art.
  • Keywords
    Bayes methods; Huffman codes; computer vision; human computer interaction; learning (artificial intelligence); Bag-of-Words; BoW; Harris corners; Huffman coding; IAM; STIP; computer vision; human action recognition; implicit action model; intelligent human-computer interactions; local features; machine learning methods; multichannel feature description; naïve Bayesian method; spatio temporal relationships; spatio-temporal interest points; visual words conditional probabilities; Accuracy; Feature extraction; Huffman coding; Learning systems; Niobium; Visualization; Huffman coding; action recognition; feature fusion; hierarchical codebooks; implicit action model (IAM); sparse coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA), 2015 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CIVEMSA.2015.7158603
  • Filename
    7158603