مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised learning of translation invariant occlusive components

DocumentCode :

2716826

Title :

Unsupervised learning of translation invariant occlusive components

Author :

Dai, Zhenwen ; Lücke, Jörg

fYear :

2012

fDate :

16-21 June 2012

Firstpage :

2400

Lastpage :

2407

Abstract :

We study unsupervised learning of occluding objects in images of visual scenes. The derived learning algorithm is based on a probabilistic generative model which parameterizes object shapes, object features and the background. No assumptions are made for the object orders in depth or the objects´ planar positions. Parameter optimization is thus subject to the large combinatorics of depth orders and positions. Previous approaches constrained this combinatorics but were still only able to learn a very small number of objects. By applying a novel variational EM approach, we show that even without constraints on the object combinatorics, a relatively large number of objects can be learned. In different numerical experiments, our unsupervised approach extracts explicit object representations with object masks and object features closely aligned with the true objects in the scenes. We investigate the robustness of the approach and the use of the learned representations for inference. Furthermore, we demonstrate generality of the approach by applying it to grayscale images, color-vector images, and Gabor-vector images as well as to motion trajectory data for which the extracted components correspond to motion primitives.

Keywords :

computer graphics; image colour analysis; image motion analysis; inference mechanisms; optimisation; probability; unsupervised learning; Gabor-vector images; background parameterization; color-vector images; explicit object representation extraction; grayscale images; inference; motion primitives; motion trajectory data; object combinatorics; object features parameterization; object masks; object occlusion; object shape parameterization; parameter optimization; probabilistic generative model; translation invariant occlusive components; unsupervised learning; variational EM approach; visual scene image; Annealing; Approximation methods; Feature extraction; Optimization; Vectors; Videos; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4673-1226-4

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2012.6247953

Filename :

6247953

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2716826