DocumentCode :
2998803
Title :
Contextual Action Recognition in Multi-sensor Nighttime Video Sequences
Author :
Haq, Anwaar-ul ; Gondal, Iqbal ; Murshed, Manzur
Author_Institution :
GSIT, Monash Univ., Melbourne, VIC, Australia
fYear :
2011
fDate :
6-8 Dec. 2011
Firstpage :
256
Lastpage :
261
Abstract :
Contextual information is important for interpreting human actions especially when actions exhibit interactive relationship with their context. Contextual clues become even more crucial when videos are captured in unfavorable conditions like extreme low light nighttime scenarios. These conditions encourage the use of multi-senor imagery and context enhancement. In this paper, we explore the importance of contextual knowledge for recognizing human actions in multi-sensor nighttime videos. Information fusion is utilized for encapsulating visual information about actions and their context. Space-time action information is contained using 3D fourier transform of fused action silhouette volume. In parallel, SIFT context images are extracted and fused using principal component analysis based feature fusion for each action class. Contextual dissimilarity is penalized by minimizing context SIFT flow energy. The action dataset comprises multi-sensor night vision video data from infra-red and visible spectrum. Experimental results show that fused contextual action information boost action recognition performance as compared to the baseline action recognition approach.
Keywords :
Fourier transforms; data encapsulation; feature extraction; gesture recognition; image enhancement; image fusion; image sequences; infrared spectra; principal component analysis; video signal processing; visible spectra; 3D Fourier transform; SIFT context images; action dataset; context SIFT flow energy; context enhancement; contextual action recognition; contextual clues; contextual dissimilarity; contextual information; contextual knowledge; encapsulating visual information; extreme low light nighttime scenarios; feature fusion; fused action silhouette volume; human actions interpretation; information fusion; interactive relationship; multisensor imagery; multisensor nighttime video sequences; multisensor nighttime videos; principal component analysis; space-time action information; unfavorable conditions; Context; Humans; Image color analysis; Sensors; Streaming media; Video sequences; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Image Computing Techniques and Applications (DICTA), 2011 International Conference on
Conference_Location :
Noosa, QLD
Print_ISBN :
978-1-4577-2006-2
Type :
conf
DOI :
10.1109/DICTA.2011.49
Filename :
6128691
Link To Document :
بازگشت