DocumentCode :
3382952
Title :
CNN based high performance computing for real time image processing on GPU
Author :
Potluri, Sreeram ; Fasih, Alireza ; Vutukuru, Laxminand Kishore ; Al Machot, Fadi ; Kyamakya, Kyandoghere
Author_Institution :
Transp. Inf. Group, Alpen-Adria Univ. of Klagenfurt, Klagenfurt, Austria
fYear :
2011
fDate :
25-27 July 2011
Firstpage :
1
Lastpage :
7
Abstract :
Many of the basic image processing tasks suffer from processing overhead to operate over the whole image. In real time applications the processing time is considered as a big obstacle for its implementations. A High Performance Computing (HPC) platform is necessary in order to solve this problem. The usage of hardware accelerator make the processing time low. In recent developments, the Graphics Processing Unit (GPU) is being used in many applications. Along with the hardware accelerator a proper choice of the computing algorithm makes it an added advantage for fast processing of images. The Cellular Neural Network (CNN) is a large-scale nonlinear analog circuit able to process signals in real time [1]. In this paper, we develop a new design in evaluation of image processing algorithms on the massively parallel GPUs with CNN implementation using Open Computing Language (OpenCL) programming model. This implementation uses the Discrete Time CNN (DT-CNN) model which is derived from originally proposed CNN model. The inherent massive parallelism of CNN along with GPUs makes it an advantage for high performance computing platform [2]. The advantage of OpenCL makes the design to be portable on all the available graphics processing devices and multi core processors. Performance evaluation is done in terms of execution time with both device (i.e. GPU) and host (i.e. CPU).
Keywords :
analogue circuits; cellular neural nets; computer graphic equipment; coprocessors; image processing; multiprocessing systems; parallel programming; CNN based high performance computing; DT-CNN model; OpenCL; cellular neural network; discrete time CNN model; graphical processing unit; hardware accelerator; large-scale nonlinear analog circuit; multicore processor; open computing language programming model; parallel GPU; performance evaluation; real time image processing; signal processing; Computer architecture; Equations; Graphics processing unit; Image processing; Kernel; Mathematical model; Cellular Neural Networks; GPUs; Hardware accelerators; High Performance Computing; Image processing; OpenCL;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Nonlinear Dynamics and Synchronization (INDS) & 16th Int'l Symposium on Theoretical Electrical Engineering (ISTET), 2011 Joint 3rd Int'l Workshop on
Conference_Location :
Klagenfurt
Print_ISBN :
978-1-4577-0759-9
Type :
conf
DOI :
10.1109/INDS.2011.6024781
Filename :
6024781
Link To Document :
بازگشت