مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning class-specific image transformations with higher-order Boltzmann machines

DocumentCode :

3458119

Title :

Learning class-specific image transformations with higher-order Boltzmann machines

Author :

Huang, Gary B. ; Learned-Miller, Erik

Author_Institution :

Univ. of Massachusetts Amherst, Amherst, MA, USA

fYear :

2010

fDate :

13-18 June 2010

Firstpage :

Lastpage :

Abstract :

In this paper, we examine the problem of learning a representation of image transformations specific to a complex object class, such as faces. Learning such a representation for a specific object class would allow us to perform improved, pose-invariant visual verification, such as unconstrained face verification. We build off of the method of using factored higher-order Boltzmann machines to model such image transformations. Using this approach will potentially enable us to use the model as one component of a larger deep architecture. This will allow us to use the feature information in an ordinary deep network to perform better modeling of transformations, and to infer pose estimates from the hidden representation. We focus on applying these higher-order Boltzmann machines to the NORB 3D objects data set and the Labeled Faces in the Wild face data set. We first show two different approaches to using this method on these object classes, demonstrating that while some useful transformation information can be extracted, ultimately the simple direct application of these models to higher-resolution, complex object classes is insufficient to achieve improved visual verification performance. Instead, we believe that this method should be integrated into a larger deep architecture, and show initial results using the higher-order Boltzmann machine as the second layer of a deep architecture, above a first layer convolutional RBM.

Keywords :

Boltzmann machines; feature extraction; image representation; learning (artificial intelligence); NORB 3D objects data set; class specific image transformations learning; factored higher order Boltzmann machines; first layer convolutional RBM; image representation; pose estimation; pose invariant visual verification; transformation information extraction; unconstrained face verification; visual verification performance; wild face data set; Airplanes; Bicycles; Data mining; Face detection; Face recognition; Filters; Head; Machine learning; Object recognition; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on

Conference_Location :

San Francisco, CA

ISSN :

2160-7508

Print_ISBN :

978-1-4244-7029-7

Type :

conf

DOI :

10.1109/CVPRW.2010.5543185

Filename :

5543185

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3458119