مرکز منطقه ای اطلاع رساني علوم و فناوري - A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-Scale Heterogeneous Supercomputers

DocumentCode :

611051

Title :

A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-Scale Heterogeneous Supercomputers

Author :

Shirahata, Koichi ; Sato, Hikaru ; Suzumura, Toyotaro ; Matsuoka, Shingo

Author_Institution :

Dept. of Math. & Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan

fYear :

2013

fDate :

13-16 May 2013

Firstpage :

277

Lastpage :

284

Abstract :

Fast processing for extremely large-scale graph is becoming increasingly important in various domains such as health care, social networks, intelligence, system biology, and electric power grids. The GIM-V algorithm based on MapReduce programing model is designed as a general graph processing method for supporting petabyte-scale graph data. On the other hand, recent large-scale data-intensive computing systems tend to employ GPU accelerators to gain good peak performance and high memory bandwidth, however, the validity of acceleration, including optimization techniques, of the GIM-V algorithm using GPUs is an open problem. To address the problem, we implemented a multi-GPU-based GIM-V application with load balance optimization between GPU devices. Our implementation extends the existing MapReduce library for supporting multi-GPU-environments using the MPI library and optimizes load balance between GPU devices by employing task scheduling-based graph partitioning. We conducted our implementation on the TSUBAME2.0 supercomputer using 256 nodes (6144 hyper-threaded CPU cores, 768 GPUs). The results exhibit that our GPU-based implementation performed 87.04 ME/s on 2³⁰ (1.07 billion) vertices and 2³⁴ (17.2 billion) edges, and 1.52 times faster than the CPU-based naive implementation with 2^29 vertices and 2³³ edges. We also studied the performance characteristics of our implementation and load balance optimization technique.

Keywords :

application program interfaces; computer graphics; graphics processing units; message passing; parallel programming; resource allocation; GIM-V algorithm; GPU accelerator; MPI library; MapReduce programing model; MapReduce-based graph processing algorithm; TSUBAME2.0 supercomputer; data-intensive computing system; electric power grid; graphics processing unit; health care; large-scale heterogeneous supercomputer; load balance optimization; memory bandwidth; message passing interface; petabyte-scale graph data support; social network; system biology; Graphics processing units; Instruction sets; Libraries; Mars; Optimization; Performance evaluation; Supercomputers; GPGPU; Large-scale Graph Processing; MapReduce;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on

Conference_Location :

Delft

Print_ISBN :

978-1-4673-6465-2

Type :

conf

DOI :

10.1109/CCGrid.2013.85

Filename :

6546103

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=611051