Title :
Hybrid Parallel Programming on GPU Clusters
Author :
Yang, Chao-Tung ; Huang, Chih-Lin ; Lin, Cheng-Fang ; Chang, Tzu-Chieh
Author_Institution :
Dept. of Comput. Sci., Tunghai Univ., Taichung, Taiwan
Abstract :
Nowadays, NVIDIA´s CUDA is a general purpose scalable parallel programming model for writing highly parallel applications. It provides several key abstractions - a hierarchy of thread blocks, shared memory, and barrier synchronization. This model has proven quite successful at programming multithreaded many core GPUs and scales transparently to hundreds of cores: scientists throughout industry and academia are already using CUDA to achieve dramatic speedups on production and research codes. In this paper, we propose a hybrid parallel programming approach using hybrid CUDA and MPI programming, which partition loop iterations according to the number of C1060 GPU nodes in a GPU cluster which consists of one C1060 and one S1070. Loop iterations assigned to one MPI process are processed in parallel by CUDA run by the processor cores in the same computational node.
Keywords :
computer graphic equipment; coprocessors; message passing; parallel programming; C1060 GPU cluster; NVIDIA CUDA programming model; S1070 GPU cluster; graphics processing units; hybrid parallel programming; loop iterations; message passing interface; multithreaded programming; Computational modeling; Graphics processing unit; Instruction sets; Linux; Parallel processing; Parallel programming; CUDA; GPU; MPI; OpenMP; hybrid; parallel programming;
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2010 International Symposium on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-8095-1
Electronic_ISBN :
978-0-7695-4190-7
DOI :
10.1109/ISPA.2010.97