DocumentCode :
3458119
Title :
Learning class-specific image transformations with higher-order Boltzmann machines
Author :
Huang, Gary B. ; Learned-Miller, Erik
Author_Institution :
Univ. of Massachusetts Amherst, Amherst, MA, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
25
Lastpage :
32
Abstract :
In this paper, we examine the problem of learning a representation of image transformations specific to a complex object class, such as faces. Learning such a representation for a specific object class would allow us to perform improved, pose-invariant visual verification, such as unconstrained face verification. We build off of the method of using factored higher-order Boltzmann machines to model such image transformations. Using this approach will potentially enable us to use the model as one component of a larger deep architecture. This will allow us to use the feature information in an ordinary deep network to perform better modeling of transformations, and to infer pose estimates from the hidden representation. We focus on applying these higher-order Boltzmann machines to the NORB 3D objects data set and the Labeled Faces in the Wild face data set. We first show two different approaches to using this method on these object classes, demonstrating that while some useful transformation information can be extracted, ultimately the simple direct application of these models to higher-resolution, complex object classes is insufficient to achieve improved visual verification performance. Instead, we believe that this method should be integrated into a larger deep architecture, and show initial results using the higher-order Boltzmann machine as the second layer of a deep architecture, above a first layer convolutional RBM.
Keywords :
Boltzmann machines; feature extraction; image representation; learning (artificial intelligence); NORB 3D objects data set; class specific image transformations learning; factored higher order Boltzmann machines; first layer convolutional RBM; image representation; pose estimation; pose invariant visual verification; transformation information extraction; unconstrained face verification; visual verification performance; wild face data set; Airplanes; Bicycles; Data mining; Face detection; Face recognition; Filters; Head; Machine learning; Object recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on
Conference_Location :
San Francisco, CA
ISSN :
2160-7508
Print_ISBN :
978-1-4244-7029-7
Type :
conf
DOI :
10.1109/CVPRW.2010.5543185
Filename :
5543185
Link To Document :
بازگشت