Title :
Learning directional co-occurrence for human action classification
Author :
Hong Liu ; Mengyuan Liu ; Qianru Sun
Author_Institution :
Key Lab. of Machine Perception, Peking Univ., Beijing, China
Abstract :
Spatio-temporal interest point (STIP) based methods have shown promising results for human action classification. However, state-of-art works typically utilize bag-of-visual words (BoVW), which focuses on the statistical distribution of features but ignores their inherent structural relationships. To solve this problem, a descriptor, namely directional pair-wise feature (DPF), is proposed to encode the mutual direction information between pairwise words, aiming at adding more spatial discriminant to BoVW. Firstly, STIP features are extracted and classified into a set of labeled words. Then in each frame, the DPF is constructed for every pair of words with different labels, according to their assigned directional vector. Finally, DPFs are quantized to be a probability histogram as a representation of human action. The proposed method is evaluated on two challenging datasets, Rochester and UT-interaction, and the results based on chi-squared kernel SVM classifiers confirm that our method can classify human actions with high accuracies.
Keywords :
feature extraction; gesture recognition; image classification; image motion analysis; image representation; learning (artificial intelligence); quantisation (signal); statistical distributions; support vector machines; video signal processing; BoVW; DPF descriptor; Rochester datasets; STIP based methods; STIP feature classification; STIP feature extraction; UT-interaction datasets; bag-of-visual words; chi-squared kernel SVM classifiers; directional cooccurrence learning; directional pair-wise feature descriptor; directional vector; human action classification; human action representation; labeled words; mutual direction information; pairwise words; probability histogram; quantization; spatial discriminant; spatio-temporal interest point; statistical distribution; Clustering methods; Feature extraction; Histograms; Streaming media; Sun; Visualization; Spatio-temporal interest point; bag-of-visual words; co-occurrence;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853794