Title :
Analysis of a GPU based CNN implementation
Author :
László, Endre ; Szolgay, Péter ; Nagy, Zoltán
Author_Institution :
Fac. of Inf. Technol., Pazmany Peter Catholic Univ., Budapest, Hungary
Abstract :
The CNN (Cellular Neural Network) is a powerful image processing architecture whose hardware implementation is extremely fast. The lack of such hardware device in a development process can be substituted by using an efficient simulator implementation. Commercially available graphics cards with high computing capabilities make this simulator feasible. The aim of this work is to present a GPU based implementation of a CNN simulator using nVidia´s Fermi architecture. Different implementation approaches are considered and compared to a multi-core, multi-threaded CPU and some earlier GPU implementations. A detailed analysis of the introduced GPU implementation is presented.
Keywords :
cellular neural nets; graphics processing units; multiprocessing systems; parallel architectures; GPU based CNN implementation; cellular neural network; graphics cards; hardware implementation; image processing architecture; multicore multithreaded CPU; nVidia Fermi architecture; simulator implementation; Arrays; Equations; Graphics processing unit; Hardware; Instruction sets; Mathematical model;
Conference_Titel :
Cellular Nanoscale Networks and Their Applications (CNNA), 2012 13th International Workshop on
Conference_Location :
Turin
Print_ISBN :
978-1-4673-0287-6
DOI :
10.1109/CNNA.2012.6331451