• DocumentCode
    443134
  • Title

    Perceptual scale space and its applications

  • Author

    Wang, Yizhou ; Bahrami, Siavosh ; Zhu, Song-Chun

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
  • Volume
    1
  • fYear
    2005
  • fDate
    17-21 Oct. 2005
  • Firstpage
    58
  • Abstract
    In this paper, we study a perceptual scale space by constructing a so-called sketch pyramid which augments the Gaussian and Laplacian pyramid representations in traditional image scale space theory. Each level of this sketch pyramid is a generic attributed graph - called the primal sketch which is inferred from the corresponding image at the same level of the Gaussian pyramid. When images are viewed at increasing resolutions, more details are revealed. This corresponds to perceptual transitions which are represented by topological changes in the sketch graph in terms of a graph grammar. We compute the sketch or perceptual pyramid by Bayesian inference upwards-downwards the pyramid using Markov chain Monte Carlo reversible jumps. We show two example applications of this perceptual scale space: (1) motion tracking of objects over scales, and (2) adaptive image displays which can efficiently show a large high resolution image in a small screen (of a PDA for example) through a selective tour of its image pyramid. Other potential applications include super resolution and multiresolution object recognition.
  • Keywords
    Bayes methods; Gaussian processes; Markov processes; Monte Carlo methods; graph grammars; image representation; image resolution; tracking; Bayesian inference; Gaussian pyramid representation; Laplacian pyramid representation; Markov chain; Monte Carlo; adaptive image display; generic attributed graph; graph grammar; image pyramid; image resolution; image scale space theory; multiresolution object recognition; object motion tracking; perceptual pyramid; perceptual scale space; primal sketch graph; sketch pyramid; Application software; Bayesian methods; Computer science; Dictionaries; Image resolution; Laplace equations; Large screen displays; Monte Carlo methods; Statistics; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
  • ISSN
    1550-5499
  • Print_ISBN
    0-7695-2334-X
  • Type

    conf

  • DOI
    10.1109/ICCV.2005.187
  • Filename
    1541239