DocumentCode :
1815931
Title :
Analysis of a GPU based CNN implementation
Author :
László, Endre ; Szolgay, Péter ; Nagy, Zoltán
Author_Institution :
Fac. of Inf. Technol., Pazmany Peter Catholic Univ., Budapest, Hungary
fYear :
2012
fDate :
29-31 Aug. 2012
Firstpage :
1
Lastpage :
5
Abstract :
The CNN (Cellular Neural Network) is a powerful image processing architecture whose hardware implementation is extremely fast. The lack of such hardware device in a development process can be substituted by using an efficient simulator implementation. Commercially available graphics cards with high computing capabilities make this simulator feasible. The aim of this work is to present a GPU based implementation of a CNN simulator using nVidia´s Fermi architecture. Different implementation approaches are considered and compared to a multi-core, multi-threaded CPU and some earlier GPU implementations. A detailed analysis of the introduced GPU implementation is presented.
Keywords :
cellular neural nets; graphics processing units; multiprocessing systems; parallel architectures; GPU based CNN implementation; cellular neural network; graphics cards; hardware implementation; image processing architecture; multicore multithreaded CPU; nVidia Fermi architecture; simulator implementation; Arrays; Equations; Graphics processing unit; Hardware; Instruction sets; Mathematical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cellular Nanoscale Networks and Their Applications (CNNA), 2012 13th International Workshop on
Conference_Location :
Turin
ISSN :
2165-0160
Print_ISBN :
978-1-4673-0287-6
Type :
conf
DOI :
10.1109/CNNA.2012.6331451
Filename :
6331451
Link To Document :
بازگشت