• DocumentCode
    2716826
  • Title

    Unsupervised learning of translation invariant occlusive components

  • Author

    Dai, Zhenwen ; Lücke, Jörg

  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    2400
  • Lastpage
    2407
  • Abstract
    We study unsupervised learning of occluding objects in images of visual scenes. The derived learning algorithm is based on a probabilistic generative model which parameterizes object shapes, object features and the background. No assumptions are made for the object orders in depth or the objects´ planar positions. Parameter optimization is thus subject to the large combinatorics of depth orders and positions. Previous approaches constrained this combinatorics but were still only able to learn a very small number of objects. By applying a novel variational EM approach, we show that even without constraints on the object combinatorics, a relatively large number of objects can be learned. In different numerical experiments, our unsupervised approach extracts explicit object representations with object masks and object features closely aligned with the true objects in the scenes. We investigate the robustness of the approach and the use of the learned representations for inference. Furthermore, we demonstrate generality of the approach by applying it to grayscale images, color-vector images, and Gabor-vector images as well as to motion trajectory data for which the extracted components correspond to motion primitives.
  • Keywords
    computer graphics; image colour analysis; image motion analysis; inference mechanisms; optimisation; probability; unsupervised learning; Gabor-vector images; background parameterization; color-vector images; explicit object representation extraction; grayscale images; inference; motion primitives; motion trajectory data; object combinatorics; object features parameterization; object masks; object occlusion; object shape parameterization; parameter optimization; probabilistic generative model; translation invariant occlusive components; unsupervised learning; variational EM approach; visual scene image; Annealing; Approximation methods; Feature extraction; Optimization; Vectors; Videos; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6247953
  • Filename
    6247953