Title :
Towards GPU-Accelerated Large-Scale Graph Processing in the Cloud
Author :
Jianlong Zhong ; Bingsheng He
Author_Institution :
Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Recently, we have witnessed that cloud providers start to offer heterogeneous computing environments. There have been wide interests in both clusters and cloud of adopting graphics processors (GPUs) as accelerators for various applications. On the other hand, large-scale graph processing is important for many data-intensive applications in the cloud. In this paper, we propose to leverage GPUs to accelerate large-scale graph processing in the cloud. Specifically, we develop an in-memory graph processing engine G2 with three non-trivial GPU-specific optimizations. Firstly, we adopt fine-grained APIs to take advantage of the massive thread parallelism of the GPU. Secondly, G2 embraces a graph partition based approach for load balancing on heterogeneous CPU/GPU architectures. Thirdly, a runtime system is developed to perform transparent memory management on the GPU, and to perform scheduling for an improved throughput of concurrent kernel executions from graph tasks. We have conducted experiments on an Amazon EC2 virtual cluster of eight nodes. Our preliminary results demonstrate that 1) GPU is a viable accelerator for cloud-based graph processing, and 2) the proposed optimizations improve the performance of GPU-based graph processing engine. We further present the lessons learnt and open problems towards large-scale graph processing with GPU accelerations.
Keywords :
cloud computing; graph theory; graphics processing units; operating system kernels; processor scheduling; resource allocation; software performance evaluation; storage management; virtual machines; Amazon EC2 virtual cluster; G2 in-memory graph processing engine; GPU-accelerated large-scale graph processing; GPU-based graph processing engine; cloud computing; cloud-based graph processing; concurrent kernel executions; data-intensive applications; graph partition based approach; graph tasks; graphics processors; heterogeneous CPU architectures; heterogeneous GPU architectures; heterogeneous computing environments; load balancing; massive thread parallelism; nontrivial GPU-specific optimizations; performance improvement; runtime system; throughput improvement; transparent memory management; Cloud computing; Engines; Graphics processing units; Kernel; Optimization; Parallel processing; Programming; GPGPU; GPU accelerations; Large-scale graph processing; cloud computing; graph partitioning;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2013 IEEE 5th International Conference on
Conference_Location :
Bristol
DOI :
10.1109/CloudCom.2013.8