Title :
Performance and Scalability of GPU-Based Convolutional Neural Networks
Author :
Strigl, Daniel ; Kofler, Klaus ; Podlipnig, Stefan
Author_Institution :
Distrib. & Parallel Syst. Group, Univ. of Innsbruck, Innsbruck, Austria
Abstract :
In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.
Keywords :
computer graphic equipment; coprocessors; learning (artificial intelligence); multilayer perceptrons; GPU-based convolutional neural networks; face detection; multilayer perceptron neural networks; network topology classification; network topology training; optical character recognition; two-dimensional pattern recognition problems; Acceleration; Cellular neural networks; Multi-layer neural network; Multilayer perceptrons; Neural networks; Optical character recognition software; Optical computing; Optical fiber networks; Pattern recognition; Scalability; CUDA; GPGPU; convolutional neural networks; machine learning; performance; scalability;
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2010 18th Euromicro International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-5672-7
Electronic_ISBN :
1066-6192
DOI :
10.1109/PDP.2010.43