DocumentCode
178814
Title
Automatic Segmentation and Recognition of Human Actions in Monocular Sequences
Author
Orrite, C. ; Rodriguez, M. ; Herrero, E. ; Rogez, G. ; Velastin, S.A.
Author_Institution
Aragon Inst. of Eng. Res., Univ. of Zaragoza, Zaragoza, Spain
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
4218
Lastpage
4223
Abstract
This paper addresses the problem of silhouette-based human action segmentation and recognition in monocular sequences. Motion History Images (MHIs), used as 2D templates, capture motion information by encoding where and when motion occurred in the images. Inspired by codebook approaches for object and scene categorization, we first construct a codebook of temporal motion templates by clustering all the MHIs of each particular action. These MHIs capture different actors, speeds and a wide range of camera viewpoints. In this paper, we use a Kohonen´s Self-Organizing Map (SOM) to simultaneously cluster the MHI templates and represent them in lower dimensional subspaces. To cope with temporal segmentation, and concurrently carry out action recognition, a new architecture is proposed where the obsrvation MHIs are projected onto all these action-specific manifolds and the Euclidean distance between each MHI and the nearest cluster within each action-manifold constitutes the observation vector of a Markov Model. To estimate the state/action at each time step, we introduce a new method based on Observable Markov Models (OMMs) where the Markov model is augmented with a neutral state. The combination of our action-specific manifolds with the augmented OMM allows to automatically segment and recognize long sequences of consecutive actions, without any prior knowledge about initial and ending frames of each action. Importantly, our method allows to interpolate betweeen training viewpoint and recognizes actions, independently of the camera viewpoint, even from unseen viewpoints.
Keywords
Markov processes; cameras; image coding; image recognition; image segmentation; image sequences; interpolation; motion estimation; self-organising feature maps; video signal processing; 2D templates; Euclidean distance; MHI template clustering; OMM; SOM; action estimation; action-specific manifolds; automatic silhouette-based human action recognition; automatic silhouette-based human action segmentation; camera viewpoint; information encoding; interpolation; low-dimensional subspace representation; monocular sequences; motion history images; motion information capture; nearest cluster; neutral state; observable Markov models; observation vector; self-organizing map; state estimation; temporal motion template codebook; temporal segmentation; training viewpoint; unseen viewpoints; Cameras; Hidden Markov models; Kernel; Manifolds; Markov processes; Motion segmentation; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.723
Filename
6977435
Link To Document