DocumentCode :
2050513
Title :
Performance Portability of a GPU Enabled Factorization with the DAGuE Framework
Author :
Bosilca, George ; Bouteiller, Aurelien ; Herault, Thomas ; Lemarinier, Pierre ; Saengpatsa, Narapat Ohm ; Tomov, Stanimire ; Dongarra, Jack J.
fYear :
2011
fDate :
26-30 Sept. 2011
Firstpage :
395
Lastpage :
402
Abstract :
Performance portability is a major challenge faced today by developers on heterogeneous high performance computers, consisting of an interconnect, memory with non-uniform access, many-cores and accelerators like GPUs. Recent studies have successfully demonstrated that dense linear algebra operations can be efficiently handled by runtime systems using a DAG representation. In this work, we present the GPU subsystem of the DAGuE runtime, and assess, on the Cholesky factorization test case, the minimal efforts required by a programmer to enable GPU acceleration in the DAGuE framework. The performance achieved by this unchanged code, on a variety of heterogeneous and distributed many cores and GPU resources, demonstrates the desired performance portability.
Keywords :
coprocessors; multiprocessing systems; performance evaluation; Cholesky factorization test; DAG representation; DAGuE framework; DAGuE runtime; GPU acceleration; GPU enabled factorization; GPU resources; GPU subsystem; accelerators; dense linear algebra operation; heterogeneous high performance computers; many-cores; performance portability; Central Processing Unit; Graphics processing unit; Hardware; Kernel; Processor scheduling; Runtime; Tiles; DAG scheduling; GPU; cluster; linear algebra;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing (CLUSTER), 2011 IEEE International Conference on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4577-1355-2
Electronic_ISBN :
978-0-7695-4516-5
Type :
conf
DOI :
10.1109/CLUSTER.2011.51
Filename :
6061070
Link To Document :
بازگشت