• DocumentCode
    8404
  • Title

    Categorizing Dynamic Textures Using a Bag of Dynamical Systems

  • Author

    Ravichandran, Arunkumar ; Chaudhry, Rizwan ; Vidal, Rene

  • Author_Institution
    UCLA Vision Lab., Univ. of California, Los Angeles, Los Angeles, CA, USA
  • Volume
    35
  • Issue
    2
  • fYear
    2013
  • fDate
    Feb. 2013
  • Firstpage
    342
  • Lastpage
    353
  • Abstract
    We consider the problem of categorizing video sequences of dynamic textures, i.e., nonrigid dynamical objects such as fire, water, steam, flags, etc. This problem is extremely challenging because the shape and appearance of a dynamic texture continuously change as a function of time. State-of-the-art dynamic texture categorization methods have been successful at classifying videos taken from the same viewpoint and scale by using a Linear Dynamical System (LDS) to model each video, and using distances or kernels in the space of LDSs to classify the videos. However, these methods perform poorly when the video sequences are taken under a different viewpoint or scale. In this paper, we propose a novel dynamic texture categorization framework that can handle such changes. We model each video sequence with a collection of LDSs, each one describing a small spatiotemporal patch extracted from the video. This Bag-of-Systems (BoS) representation is analogous to the Bag-of-Features (BoF) representation for object recognition, except that we use LDSs as feature descriptors. This choice poses several technical challenges in adopting the traditional BoF approach. Most notably, the space of LDSs is not euclidean; hence, novel methods for clustering LDSs and computing codewords of LDSs need to be developed. We propose a framework that makes use of nonlinear dimensionality reduction and clustering techniques combined with the Martin distance for LDSs to tackle these issues. Our experiments compare the proposed BoS approach to existing dynamic texture categorization methods and show that it can be used for recognizing dynamic textures in challenging scenarios which could not be handled by existing methods.
  • Keywords
    feature extraction; image sequences; image texture; pattern clustering; video signal processing; BoF; BoS; LDS; Martin distance; bag-of-features representation; bag-of-systems representation; clustering techniques; codewords; dynamic texture categorization methods; feature descriptors; fire; flags; linear dynamical system; nonlinear dimensionality reduction; nonrigid dynamical objects; object recognition; spatiotemporal patch; steam; video classification; video sequence categorization; water; Feature extraction; Heuristic algorithms; Measurement; Observability; Spatiotemporal phenomena; Training; Video sequences; Dynamic textures; categorization; linear dynamical systems; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2012.83
  • Filename
    6178260