Title :
Building a better probabilistic model of images by factorization
Author :
Culpepper, Benjamin J. ; Sohl-Dickstein, Jascha ; Olshausen, Bruno A.
Author_Institution :
UC Berkeley, Berkeley, CA, USA
Abstract :
We describe a directed bilinear model that learns higher-order groupings among features of natural images. The model represents images in terms of two sets of latent variables: one set of variables represents which feature groups are active, while the other specifies the relative activity within groups. Such a factorized representation is beneficial because it is stable in response to small variations in the placement of features while still preserving information about relative spatial relationships. When trained on MNIST digits, the resulting representation provides state of the art performance in classification using a simple classifier. When trained on natural images, the model learns to group features according to proximity in position, orientation, and scale. The model achieves high log-likelihood (-94 nats), surpassing the current state of the art for natural images achievable with an mcRBM model.
Keywords :
image classification; image representation; probability; MNIST digits; directed bilinear model; factorized representation; higher order grouping; latent variables; log likelihood; natural image classification; probabilistic model; relative spatial relationships; Accuracy; Computational modeling; Data models; Kernel; Mathematical model; Training; Vectors;
Conference_Titel :
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4577-1101-5
DOI :
10.1109/ICCV.2011.6126473