DocumentCode :
595555
Title :
Sparse shift-invariant representation of local 2D patterns and sequence learning for human action recognition
Author :
Baccouche, Moez ; Mamalet, F. ; Wolf, Christian ; Garcia, Christophe ; Baskurt, A.
Author_Institution :
Orange Labs. R&D, France
fYear :
2012
fDate :
11-15 Nov. 2012
Firstpage :
3823
Lastpage :
3826
Abstract :
Most existing methods for action recognition mainly rely on manually engineered features which, despite their good performances, are highly problem dependent. We propose in this paper a fully automated model, which learns to classify human actions without using any prior knowledge. A convolutional sparse autoencoder learns to extract sparse shift-invariant representations of the 2D local patterns present in each video frame. The evolution of these mid-level features is learned by a Recurrent Neural Network trained to classify each sequence. Experimental results on the KTH dataset show that the proposed approach outperforms existing models which rely on learned-features, and gives comparable results with the best related works.
Keywords :
feature extraction; gesture recognition; image classification; image representation; image sequences; learning (artificial intelligence); recurrent neural nets; 2D local patterns; KTH dataset; convolutional sparse autoencoder; human action recognition; learned-features; manually engineered features; midlevel features; recurrent neural network; sequence classification; sequence learning; sparse shift-invariant local 2D pattern representation; video frame; Convolutional codes; Decoding; Feature extraction; Humans; Pattern recognition; Recurrent neural networks; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
ISSN :
1051-4651
Print_ISBN :
978-1-4673-2216-4
Type :
conf
Filename :
6460998
Link To Document :
بازگشت