Title :
Adapting Irregular Computations to Large CPU-GPU Clusters in the MADNESS Framework
Author :
Slavici, Vlad ; Varier, Raghu ; Cooperman, Gene ; Harrison, Robert J.
Author_Institution :
Northeastern Univ., Boston, MA, USA
Abstract :
Graphics Processing Units (GPUs) are becoming the workhorse of scalable computations. MADNESS is a scientific framework used especially for computational chemistry. Most MADNESS applications use operators that involve many small tensor computations, resulting in a less regular organization of computations on GPUs. A single GPU kernel may have to multiply by hundreds of small square matrices (with fixed dimension ranging from 10 to 28). We demonstrate a scalable CPU-GPU implementation of the MADNESS framework over a 500-node partition on the Titan supercomputer. For this hybrid CPU-GPU implementation, we observe up to a 2.3-times speedup compared to an equivalent CPU-only implementation with 16 cores per node. For smaller matrices, we demonstrate a speedup of 2.2-times by using a custom CUDA kernel rather than a cuBLAS-based kernel.
Keywords :
graphics processing units; matrix algebra; parallel architectures; parallel machines; statistical analysis; tensors; CPU; GPU cluster; MADNESS framework; Titan supercomputer; computational chemistry; custom CUDA kernel; graphic processing unit; scalable computation; square matrix; tensor computation; Accuracy; Computational modeling; Graphics processing unit; Instruction sets; Kernel; Libraries; Tensile stress; GPU; heterogeneous CPU-GPU; hybrid CPU-GPU; irregular computation; supercomputing;
Conference_Titel :
Cluster Computing (CLUSTER), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2422-9
DOI :
10.1109/CLUSTER.2012.42