DocumentCode
576924
Title
A GPU Implementation of Generalized Graph Processing Algorithm GIM-V
Author
Shirahata, Koichi ; Sato, Hitoshi ; Suzumura, Toyotaro ; Matsuoka, Satoshi
Author_Institution
Tokyo Inst. of Technol., Tokyo, Japan
fYear
2012
fDate
24-28 Sept. 2012
Firstpage
207
Lastpage
212
Abstract
Fast processing for extremely large-scale graph, which consists of millions to trillions of vertices and 100 billions to 100 trillions of edges, is becoming increasingly important in various domains such as health care, social networks, intelligence, system biology, and electric power grid, etc. The GIM-V algorithm based on MapReduce programing model is designed as general graph processing method for supporting petabyte-scale graph data. On the other hand, recent large-scale data-intensive computing systems tend to employ GPU accelerators to gain good peak performance and high memory bandwidth, however, the validity of acceleration, including optimization techniques, of the GIM-V algorithm using GPUs is an open problem. To address the problem, we implemented a GPU-based GIM-V application. We conducted our implementation using single node (12 hyper-threaded CPU cores, 1 GPU). The results showed that our GPU-based implementation performed 8.80 to 39.0 times faster than the original Hadoop-based GIM-V implementation (PEGASUS), and 2.72 times faster in the map stage than the CPU-based naive implementation. We also observed that the total elapsed time of our implementation introduces significant load imbalance between threads in a GPU, which causes 1.52 times performance degradation than the CPU-based implementation.
Keywords
graph theory; graphics processing units; iterative methods; mathematics computing; matrix multiplication; parallel processing; vectors; CPU-based naive implementation; GPU accelerators; GPU implementation; Hadoop-based GIM-V implementation; MapReduce programing model; PEGASUS; generalized graph processing algorithm GIM-V; generalized iterative matrix-vector multiplication algorithm; large-scale data-intensive computing systems; memory bandwidth; peak performance; petabyte-scale graph data; significant load imbalance; Acceleration; Convergence; Graphics processing units; Instruction sets; Libraries; Mars; Vectors; GPGPU; Large-scale Graph Processing; MapReduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4673-2893-7
Type
conf
DOI
10.1109/ClusterW.2012.34
Filename
6355866
Link To Document