• DocumentCode
    576924
  • Title

    A GPU Implementation of Generalized Graph Processing Algorithm GIM-V

  • Author

    Shirahata, Koichi ; Sato, Hitoshi ; Suzumura, Toyotaro ; Matsuoka, Satoshi

  • Author_Institution
    Tokyo Inst. of Technol., Tokyo, Japan
  • fYear
    2012
  • fDate
    24-28 Sept. 2012
  • Firstpage
    207
  • Lastpage
    212
  • Abstract
    Fast processing for extremely large-scale graph, which consists of millions to trillions of vertices and 100 billions to 100 trillions of edges, is becoming increasingly important in various domains such as health care, social networks, intelligence, system biology, and electric power grid, etc. The GIM-V algorithm based on MapReduce programing model is designed as general graph processing method for supporting petabyte-scale graph data. On the other hand, recent large-scale data-intensive computing systems tend to employ GPU accelerators to gain good peak performance and high memory bandwidth, however, the validity of acceleration, including optimization techniques, of the GIM-V algorithm using GPUs is an open problem. To address the problem, we implemented a GPU-based GIM-V application. We conducted our implementation using single node (12 hyper-threaded CPU cores, 1 GPU). The results showed that our GPU-based implementation performed 8.80 to 39.0 times faster than the original Hadoop-based GIM-V implementation (PEGASUS), and 2.72 times faster in the map stage than the CPU-based naive implementation. We also observed that the total elapsed time of our implementation introduces significant load imbalance between threads in a GPU, which causes 1.52 times performance degradation than the CPU-based implementation.
  • Keywords
    graph theory; graphics processing units; iterative methods; mathematics computing; matrix multiplication; parallel processing; vectors; CPU-based naive implementation; GPU accelerators; GPU implementation; Hadoop-based GIM-V implementation; MapReduce programing model; PEGASUS; generalized graph processing algorithm GIM-V; generalized iterative matrix-vector multiplication algorithm; large-scale data-intensive computing systems; memory bandwidth; peak performance; petabyte-scale graph data; significant load imbalance; Acceleration; Convergence; Graphics processing units; Instruction sets; Libraries; Mars; Vectors; GPGPU; Large-scale Graph Processing; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-2893-7
  • Type

    conf

  • DOI
    10.1109/ClusterW.2012.34
  • Filename
    6355866