• DocumentCode
    2714505
  • Title

    Dynamic scene understanding: The role of orientation features in space and time in scene classification

  • Author

    Derpanis, Konstantinos G. ; Lecce, Matthieu ; Daniilidis, Kostas ; Wildes, Richard P.

  • Author_Institution
    GRASP Lab., Univ. of Pennsylvania, Philadelphia, PA, USA
  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    1306
  • Lastpage
    1313
  • Abstract
    Natural scene classification is a fundamental challenge in computer vision. By far, the majority of studies have limited their scope to scenes from single image stills and thereby ignore potentially informative temporal cues. The current paper is concerned with determining the degree of performance gain in considering short videos for recognizing natural scenes. Towards this end, the impact of multiscale orientation measurements on scene classification is systematically investigated, as related to: (i) spatial appearance, (ii) temporal dynamics and (iii) joint spatial appearance and dynamics. These measurements in visual space, x-y, and spacetime, x-y-t, are recovered by a bank of spatiotemporal oriented energy filters. In addition, a new data set is introduced that contains 420 image sequences spanning fourteen scene categories, with temporal scene information due to objects and surfaces decoupled from camera-induced ones. This data set is used to evaluate classification performance of the various orientation-related representations, as well as state-of-the-art alternatives. It is shown that a notable performance increase is realized by spatiotemporal approaches in comparison to purely spatial or purely temporal methods.
  • Keywords
    computer vision; image classification; image representation; image sequences; computer vision; dynamic scene; image sequences; informative temporal cues; joint spatial appearance; multiscale orientation measurement; natural scene classification; natural scene recognition; orientation-related representation; performance gain; short videos; spatiotemporal oriented energy filter; temporal dynamics; temporal method; Dynamics; Energy measurement; Image sequences; Layout; Spatiotemporal phenomena; Videos; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6247815
  • Filename
    6247815