DocumentCode :
2716826
Title :
Unsupervised learning of translation invariant occlusive components
Author :
Dai, Zhenwen ; Lücke, Jörg
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
2400
Lastpage :
2407
Abstract :
We study unsupervised learning of occluding objects in images of visual scenes. The derived learning algorithm is based on a probabilistic generative model which parameterizes object shapes, object features and the background. No assumptions are made for the object orders in depth or the objects´ planar positions. Parameter optimization is thus subject to the large combinatorics of depth orders and positions. Previous approaches constrained this combinatorics but were still only able to learn a very small number of objects. By applying a novel variational EM approach, we show that even without constraints on the object combinatorics, a relatively large number of objects can be learned. In different numerical experiments, our unsupervised approach extracts explicit object representations with object masks and object features closely aligned with the true objects in the scenes. We investigate the robustness of the approach and the use of the learned representations for inference. Furthermore, we demonstrate generality of the approach by applying it to grayscale images, color-vector images, and Gabor-vector images as well as to motion trajectory data for which the extracted components correspond to motion primitives.
Keywords :
computer graphics; image colour analysis; image motion analysis; inference mechanisms; optimisation; probability; unsupervised learning; Gabor-vector images; background parameterization; color-vector images; explicit object representation extraction; grayscale images; inference; motion primitives; motion trajectory data; object combinatorics; object features parameterization; object masks; object occlusion; object shape parameterization; parameter optimization; probabilistic generative model; translation invariant occlusive components; unsupervised learning; variational EM approach; visual scene image; Annealing; Approximation methods; Feature extraction; Optimization; Vectors; Videos; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6247953
Filename :
6247953
Link To Document :
بازگشت