DocumentCode
62619
Title
Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition
Author
Bhattacharya, Surya ; Mehran, Ramin ; Sukthankar, Rahul ; Shah, Mubarak
Author_Institution
Center of Res. in Comput. Vision, Univ. of Central Florida, Orlando, FL, USA
Volume
16
Issue
3
fYear
2014
fDate
Apr-14
Firstpage
686
Lastpage
696
Abstract
In this paper, we propose a discriminative representation of a video shot based on its camera motion and demonstrate how the representation can be used for high level multimedia tasks like complex event recognition. In our technique, we assume that a homography exists between a pair of subsequent frames in a given shot. Using purely image-based methods, we compute homography parameters that serve as coarse indicators of the ambient camera motion. Next, using Lie algebra, we map the homography matrices to an intermediate vector space that preserves the intrinsic geometric structure of the transformation. The mappings are stacked temporally to generate vector time-series per shot. To extract meaningful features from time-series, we propose an efficient linear dynamical system based technique. The extracted temporal features are further used to train linear SVMs as classifiers for a particular shot class. In addition to demonstrating the efficacy of our method on a novel dataset, we extend its applicability to recognize complex events in large scale videos under unconstrained scenarios. Our empirical evaluations on eight cinematographic shot classes show that our technique performs close to approaches that involve extraction of 3-D trajectories using computationally prohibitive structure from motion techniques.
Keywords
Lie algebras; cinematography; feature extraction; image classification; image motion analysis; image representation; matrix algebra; support vector machines; time series; video signal processing; 3D trajectory; Lie algebra; camera motion; cinematographic shot classification; complex event recognition; high level multimedia task; homography matrices; image-based method; intermediate vector space; intrinsic geometric structure; large scale video; linear SVM; linear dynamical system; temporal feature; vector time-series; video shot; Cameras; Computer vision; Cranes; Feature extraction; Support vector machine classification; Vectors; Cinematographic shots; homography; lie algebra; multimedia event recognition; shot classification;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2014.2300833
Filename
6714427
Link To Document