DocumentCode
2491053
Title
A Large-Scale Architecture for Restricted Boltzmann Machines
Author
Kim, Sang Kyun ; McMahon, Peter L. ; Olukotun, Kunle
Author_Institution
Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA
fYear
2010
fDate
2-4 May 2010
Firstpage
201
Lastpage
208
Abstract
Deep Belief Nets (DBNs) are an emerging application in the machine learning domain, which use Restricted Boltzmann Machines (RBMs) as their basic building block. Although small scale DBNs have shown great potential, the computational cost of RBM training has been a major challenge in scaling to large networks. In this paper we present a highly scalable architecture for Deep Belief Net processing on hardware systems that can handle hundreds of boards, if not more, of customized logic with near linear performance increase. We elucidate tradeoffs between flexibility in the neuron connections, and the hardware resources, such as memory and communication bandwidth, required to build a custom processor design that has optimal efficiency. We illustrate how our architecture can easily support sparse networks with dense regions of connections between neighboring sets of neurons, which is relevant to applications where there are obvious spatial correlations in the data, such as in image processing. We demonstrate the feasibility of our approach by implementing a multi-FPGA system. We show that a speedup of 46X-112X over an optimized single core CPU implementation can be achieved for a four-FPGA implementation.
Keywords
Boltzmann machines; field programmable gate arrays; learning (artificial intelligence); multilayer perceptrons; RBM training; customized logic; deep belief nets; field programmable gate array; large-scale architecture; machine learning domain; multiFPGA system; neuron connections; restricted Boltzmann machines; Bandwidth; Computational efficiency; Computer architecture; Hardware; Image processing; Large-scale systems; Logic; Machine learning; Neurons; Process design; Boltzmann machines; Computer architecture; Field programmable gate arrays; Large-scale systems; Neural network hardware; Parallel processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Field-Programmable Custom Computing Machines (FCCM), 2010 18th IEEE Annual International Symposium on
Conference_Location
Charlotte, NC
Print_ISBN
978-0-7695-4056-6
Electronic_ISBN
978-1-4244-7143-0
Type
conf
DOI
10.1109/FCCM.2010.38
Filename
5474047
Link To Document