مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2173290

Title :

Images as bags of pixels

Author :

Jebara, Tony

Author_Institution :

Dept. of Comput. Sci., Columbia Univ., New York, NY, USA

fYear :

2003

fDate :

13-16 Oct. 2003

Firstpage :

265

Abstract :

We propose modeling images and related visual objects as bags of pixels or sets of vectors. For instance, gray scale images are modeled as a collection or bag of (X, Y, I) pixel vectors. This representation implies a permutational invariance over the bag of pixels, which is naturally handled by endowing each image with a permutation matrix. Each matrix permits the image to span a manifold of multiple configurations, capturing the vector set´s invariance to orderings or permutation transformations. Permutation configurations are optimized while jointly modeling many images via maximum likelihood. The solution is a uniquely solvable convex program, which computes correspondence simultaneously for all images (as opposed to traditional pairwise correspondence solutions). Maximum likelihood performs a nonlinear dimensionality reduction, choosing permutations that compact the permuted image vectors into a volumetrically minimal subspace. This is highly suitable for principal components analysis which, when applied to the permutationally invariant bag of pixels representation, outperforms PCA on appearance-based vectorization by orders of magnitude. Furthermore, the bag of pixels subspace benefits from automatic correspondence estimation, giving rise to meaningful linear variations such as morphings, translations, and jointly spatio-textural image transformations. Results are shown for several datasets.

Keywords :

image reconstruction; image representation; maximum likelihood estimation; principal component analysis; vectors; gray scale image modelling; image transformations; maximum likelihood estimation; permutation matrix; permutational invariance; permuted image pixel vector representation; principal components analysis; Computer vision; Pixel;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on

Conference_Location :

Nice, France

Print_ISBN :

0-7695-1950-4

Type :

conf

DOI :

10.1109/ICCV.2003.1238352

Filename :

1238352

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2173290