Title :
Stacks of convolutional Restricted Boltzmann Machines for shift-invariant feature learning
Author :
Norouzi, Mohammad ; Ranjbar, M. ; Mori, Greg
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
Abstract :
In this paper we present a method for learning class-specific features for recognition. Recently a greedy layer-wise procedure was proposed to initialize weights of deep belief networks, by viewing each layer as a separate restricted Boltzmann machine (RBM). We develop the convolutional RBM (C-RBM), a variant of the RBM model in which weights are shared to respect the spatial structure of images. This framework learns a set of features that can generate the images of a specific object class. Our feature extraction model is a four layer hierarchy of alternating filtering and maximum subsampling. We learn feature parameters of the first and third layers viewing them as separate C-RBMs. The outputs of our feature extraction hierarchy are then fed as input to a discriminative classifier. It is experimentally demonstrated that the extracted features are effective for object detection, using them to obtain performance comparable to the state of the art on handwritten digit recognition and pedestrian detection.
Keywords :
Boltzmann machines; belief networks; feature extraction; learning (artificial intelligence); object recognition; pattern classification; C-RBM; belief networks; convolutional restricted Boltzmann machine; discriminative classifier; feature extraction model; handwritten digit recognition; images spatial structure; pedestrian detection; shift invariant feature learning; Cellular neural networks; Computer vision; Feature extraction; Filters; Large-scale systems; Machine learning; Neural networks; Nonhomogeneous media; Object detection; Object recognition;
Conference_Titel :
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-3992-8
DOI :
10.1109/CVPR.2009.5206577