DocumentCode
2290001
Title
Automatic annotation of human actions in video
Author
Duchenne, Olivier ; Laptev, Ivan ; Sivic, Josef ; Bach, Francis ; Ponce, Jean
Author_Institution
INRIA /Ã\x89cole Normale Supérieure, Paris, France
fYear
2009
fDate
Sept. 29 2009-Oct. 2 2009
Firstpage
1491
Lastpage
1498
Abstract
This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.
Keywords
Application software; Clustering algorithms; Clustering methods; Computer vision; Costs; Detectors; Humans; Motion pictures; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location
Kyoto
ISSN
1550-5499
Print_ISBN
978-1-4244-4420-5
Electronic_ISBN
1550-5499
Type
conf
DOI
10.1109/ICCV.2009.5459279
Filename
5459279
Link To Document