DocumentCode :
2604006
Title :
Two-person interaction detection using body-pose features and multiple instance learning
Author :
Yun, Kiwon ; Honorio, Jean ; Chattopadhyay, Debaleena ; Berg, Tamara L. ; Samaras, Dimitris
Author_Institution :
Stony Brook Univ., Stony Brook, NY, USA
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
28
Lastpage :
35
Abstract :
Human activity recognition has potential to impact a wide range of applications from surveillance to human computer interfaces to content based video retrieval. Recently, the rapid development of inexpensive depth sensors (e.g. Microsoft Kinect) provides adequate accuracy for real-time full-body human tracking for activity recognition applications. In this paper, we create a complex human activity dataset depicting two person interactions, including synchronized video, depth and motion capture data. Moreover, we use our dataset to evaluate various features typically used for indexing and retrieval of motion capture data, in the context of real-time detection of interaction activities via Support Vector Machines (SVMs). Experimentally, we find that the geometric relational features based on distance between all pairs of joints outperforms other feature choices. For whole sequence classification, we also explore techniques related to Multiple Instance Learning (MIL) in which the sequence is represented by a bag of body-pose features. We find that the MIL based classifier outperforms SVMs when the sequences extend temporally around the interaction of interest.
Keywords :
feature extraction; image classification; image motion analysis; image sensors; image sequences; information retrieval; learning (artificial intelligence); object tracking; pose estimation; support vector machines; synchronisation; video signal processing; video surveillance; MIL based classifier; SVM; body-pose features; complex human activity dataset; content based video retrieval; depth capture data; geometric relational features; human activity recognition; human computer interfaces; inexpensive depth sensors; motion capture data indexing; motion capture data retrieval; multiple instance learning; real-time full-body human tracking; real-time interaction activity detection; sequence classification; sequence representation; support vector machines; two-person interaction detection; video surveillance; video synchronization; Feature extraction; Humans; Joints; Real time systems; Sensors; Tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on
Conference_Location :
Providence, RI
ISSN :
2160-7508
Print_ISBN :
978-1-4673-1611-8
Electronic_ISBN :
2160-7508
Type :
conf
DOI :
10.1109/CVPRW.2012.6239234
Filename :
6239234
Link To Document :
بازگشت