DocumentCode
8404
Title
Categorizing Dynamic Textures Using a Bag of Dynamical Systems
Author
Ravichandran, Arunkumar ; Chaudhry, Rizwan ; Vidal, Rene
Author_Institution
UCLA Vision Lab., Univ. of California, Los Angeles, Los Angeles, CA, USA
Volume
35
Issue
2
fYear
2013
fDate
Feb. 2013
Firstpage
342
Lastpage
353
Abstract
We consider the problem of categorizing video sequences of dynamic textures, i.e., nonrigid dynamical objects such as fire, water, steam, flags, etc. This problem is extremely challenging because the shape and appearance of a dynamic texture continuously change as a function of time. State-of-the-art dynamic texture categorization methods have been successful at classifying videos taken from the same viewpoint and scale by using a Linear Dynamical System (LDS) to model each video, and using distances or kernels in the space of LDSs to classify the videos. However, these methods perform poorly when the video sequences are taken under a different viewpoint or scale. In this paper, we propose a novel dynamic texture categorization framework that can handle such changes. We model each video sequence with a collection of LDSs, each one describing a small spatiotemporal patch extracted from the video. This Bag-of-Systems (BoS) representation is analogous to the Bag-of-Features (BoF) representation for object recognition, except that we use LDSs as feature descriptors. This choice poses several technical challenges in adopting the traditional BoF approach. Most notably, the space of LDSs is not euclidean; hence, novel methods for clustering LDSs and computing codewords of LDSs need to be developed. We propose a framework that makes use of nonlinear dimensionality reduction and clustering techniques combined with the Martin distance for LDSs to tackle these issues. Our experiments compare the proposed BoS approach to existing dynamic texture categorization methods and show that it can be used for recognizing dynamic textures in challenging scenarios which could not be handled by existing methods.
Keywords
feature extraction; image sequences; image texture; pattern clustering; video signal processing; BoF; BoS; LDS; Martin distance; bag-of-features representation; bag-of-systems representation; clustering techniques; codewords; dynamic texture categorization methods; feature descriptors; fire; flags; linear dynamical system; nonlinear dimensionality reduction; nonrigid dynamical objects; object recognition; spatiotemporal patch; steam; video classification; video sequence categorization; water; Feature extraction; Heuristic algorithms; Measurement; Observability; Spatiotemporal phenomena; Training; Video sequences; Dynamic textures; categorization; linear dynamical systems; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2012.83
Filename
6178260
Link To Document