Title :
Estimating mixture models of images and inferring spatial transformations using the EM algorithm
Author :
Frey, Brendan J. ; Jojic, Nebojsa
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
Abstract :
Mixture modeling and clustering algorithms are effective, simple ways to represent images using a set of data centers. However, in situations where the images include background clutter and transformations such as translation, rotation, shearing and warping, these methods extract data centers that include clutter and represent different transformations of essentially the same data. Taking face images as an example, it would be more useful for the different clusters to represent different poses and expressions, instead of cluttered versions of different translations, scales and rotations. By including clutter and transformation as unobserved, latent variables in a mixture model, we obtain a new “transformed mixture of Gaussians”, which is invariant to a specified set of transformations. We show how a linear-time EM algorithm can be used to fit this model by jointly estimating a mixture model for the data and inferring the transformation for each image. We show that this algorithm can jointly align images of a human head and learn different poses. We also find that the algorithm performs better than k-nearest neighbors and mixtures of Gaussians on handwritten digit recognition
Keywords :
clutter; computer vision; handwritten character recognition; motion estimation; Gaussians; background clutter; data centers; handwritten digit recognition; images estimation; k-nearest neighbors; linear-time EM algorithm; mixture models; rotation; shearing; spatial transformations; translation; warping; Clustering algorithms; Coherence; Gaussian noise; Gaussian processes; Head; Humans; Image recognition; Pixel; Shearing; Video sequences;
Conference_Titel :
Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on.
Conference_Location :
Fort Collins, CO
Print_ISBN :
0-7695-0149-4
DOI :
10.1109/CVPR.1999.786972