Action recognition by Huffman coding and implicit action model

Author

Nijun Li ; Tongchi Zhou ; Lin Zhou ; Zhenyang Wu

Author_Institution

Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China

fYear

2015

fDate

12-14 June 2015

Firstpage

1

Lastpage

6

Abstract

Human action recognition is at the core of computer vision, and has great application value in intelligent human-computer interactions. On the basis of Bag-of-Words (BoW), this work presents a Huffman coding and Implicit Action Model (IAM) combined framework for action recognition. Specifically, Huffman coding, which outperforms naïve Bayesian method, is a robust estimation of visual words´ conditional probabilities; whereas IAM captures the spatio-temporal relationships of local features and outperforms most other common machine learning methods. Spatio-Temporal Interest Points (STIPs) and Harris corners are employed as local features, and multichannel feature description is adopted to exploit the complementarity among different features. Experiments on UCF-YouTube and HOHA2 datasets systematically compare the performance of various feature channels and machine learning methods, demonstrating the effectiveness of the approaches proposed by this paper. Finally, multiple augment mechanisms such as feature fusion, hierarchical codebooks and sparse coding are integrated into the recognition system, achieving the best ever performance comparing with the state-of-the-art.

Keywords

Bayes methods; Huffman codes; computer vision; human computer interaction; learning (artificial intelligence); Bag-of-Words; BoW; Harris corners; Huffman coding; IAM; STIP; computer vision; human action recognition; implicit action model; intelligent human-computer interactions; local features; machine learning methods; multichannel feature description; naïve Bayesian method; spatio temporal relationships; spatio-temporal interest points; visual words conditional probabilities; Accuracy; Feature extraction; Huffman coding; Learning systems; Niobium; Visualization; Huffman coding; action recognition; feature fusion; hierarchical codebooks; implicit action model (IAM); sparse coding;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA), 2015 IEEE International Conference on

Conference_Location

Shenzhen

Type

conf

DOI

10.1109/CIVEMSA.2015.7158603

Filename

7158603