DocumentCode :
1926733
Title :
Coordinating the use of GPU and CPU for improving performance of compute intensive applications
Author :
Teodoro, G. ; Sachetto, R. ; Sertel, O. ; Gurcan, M.N. ; Meira, W. ; Catalyurek, U. ; Ferreira, R.
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil
fYear :
2009
fDate :
Aug. 31 2009-Sept. 4 2009
Firstpage :
1
Lastpage :
10
Abstract :
GPUs have recently evolved into very fast parallel co-processors capable of executing general purpose computations extremely efficiently. At the same time, multi-core CPUs evolution continued and today´s CPUs have 4-8 cores. These two trends, however, have followed independent paths in the sense that we are aware of very few works that consider both devices cooperating to solve general computations. In this paper we investigate the coordinated use of CPU and GPU to improve efficiency of applications even further than using either device independently. We use Anthill runtime environment, a data-flow oriented framework in which applications are decomposed into a set of event-driven filters, where for each event, the runtime system can use either GPU or CPU for its processing. For evaluation, we use a histopathology application that uses image analysis techniques to classify tumor images for neuroblas-toma prognosis. Our experimental environment includes dual and octa-core machines, augmented with GPUs and we evaluate our approach´s performance for standalone and distributed executions. Our experiments show that a pure GPU optimization of the application achieved a factor of 15 to 49 times improvement over the single core CPU version, depending on the versions of the CPUs and GPUs. We also show that the execution can be further reduced by a factor of about 2 by using our runtime system that effectively choreographs the execution to run cooperatively both on GPU and on a single core of CPU. We improve on that by adding more cores, all of which were previously neglected or used ineffectively. In addition, the evaluation on a distributed environment has shown near linear scalability to multiple hosts.
Keywords :
coprocessors; digital filters; medical image processing; microcomputers; tumours; Anthill runtime environment; compute intensive applications; distributed execution; dual-core machine; event-driven filters; graphics processing unit; histopathology application; image analysis techniques; multi-core CPUs; neuroblastoma prognosis; octa-core machine; parallel co-processors; standalone execution; Biomedical computing; Cancer; Computer applications; Concurrent computing; Image analysis; Microscopy; Neoplasms; Niobium; Performance analysis; Runtime environment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and Workshops, 2009. CLUSTER '09. IEEE International Conference on
Conference_Location :
New Orleans, LA
ISSN :
1552-5244
Print_ISBN :
978-1-4244-5011-4
Type :
conf
DOI :
10.1109/CLUSTR.2009.5289193
Filename :
5289193
Link To Document :
بازگشت