مرکز منطقه ای اطلاع رساني علوم و فناوري - Restricted Boltzmann Machines and Deep Belief Networks on multi-core processors

DocumentCode :

2770345

Title :

Restricted Boltzmann Machines and Deep Belief Networks on multi-core processors

Author :

Lopes, Nelson ; Ribeiro, Bernardete ; Gonçalves, João

Author_Institution :

CISUC-Center for Inf. & Syst., Univ. of Coimbra, Coimbra, Portugal

fYear :

2012

fDate :

10-15 June 2012

Firstpage :

Lastpage :

Abstract :

Deep learning architecture models by contrast with shallow models draw on the insights of biological inspiration which has been a challenge since the inception of the idea of simulating the brain. In particular their (many) hierarchical levels of composition track the development of parallel implementation in an attempt to become accessibly fast. When it comes to performance enhancement Graphics Processing Units (GPU) have carved their own strength in machine learning. In this paper, we present an approach that relies mainly on three kernels for implementing both the Restricted Boltzmann Machines (RBM) and Deep Belief Networks (DBN) algorithms. Instead of considering the neuron as the smallest unit of computation each thread represents the connection between two (one visible and one hidden) neurons. Although conceptually it may seem weird, the rationale behind is to think of a connection as performing a simple function that multiplies the clamped input by its weight. Thus, we maximize the GPU workload avoiding idle cores. Moreover, we placed great emphasis on the kernels to avoid uncoalesced memory accesses as well as to take advantage of the shared memory to reduce global memory accesses. Additionally, our approach uses a step adaptive learning rate procedure which accelerates convergence. The approach yields very good speedups (up to 46×) as compared with a straightforward implementation when both GPU and CPU implementations are tested on the MINST database.

Keywords :

Boltzmann machines; belief networks; graphics processing units; learning (artificial intelligence); multiprocessing systems; CPU implementations; DBN; GPU workload maximization; MINST database; RBM; biological inspiration; deep belief networks; deep learning architecture models; graphics processing units; machine learning; multicore processors; restricted Boltzmann machines; shallow models; step adaptive learning rate procedure; Computer architecture; Graphics processing unit; Instruction sets; Kernel; Neurons; Training; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2012 International Joint Conference on

Conference_Location :

Brisbane, QLD

ISSN :

2161-4393

Print_ISBN :

978-1-4673-1488-6

Electronic_ISBN :

2161-4393

Type :

conf

DOI :

10.1109/IJCNN.2012.6252431

Filename :

6252431

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2770345