• DocumentCode
    2205238
  • Title

    Achieving up to zero communication delay in BSP-based graph processing via vertex categorization

  • Author

    Xuhong Zhang ; Ruijun Wang ; Xunchao Chen ; Jun Wang ; Lukasiewicz, Tyler ; Dezhi Han

  • Author_Institution
    Department of Electrical Engineering & Computer Science, University of Central Florida, Orlando, 32826, USA
  • fYear
    2015
  • fDate
    6-7 Aug. 2015
  • Firstpage
    112
  • Lastpage
    121
  • Abstract
    The Bulk Synchronous Parallel (BSP) model, which divides a graphing algorithm into multiple supersteps, has become extremely popular in distributed graph processing systems. However, the high number of network messages exchanged in each superstep of the graph algorithm will create a long period of time. We refer to this as a communication delay. Furthermore, the BSP´s global synchronization barrier does not allow computation in the next superstrep to be scheduled during this communication delay. This communication delay makes up a large percentage of the overall processing time of a superstep. While most recent research has focused on reducing number of network messages, but communication delay is still a deterministic factor for overall performance. In this paper, we add a runtime communication and computation scheduler into current graph BSP implementations. This scheduler will move some computation from the next superstep to the communication phase in the current superstep to mitigate the communication delay. Finally, we prototyped our system, Zebra, on Apache Hama, which is an open source clone of the classic Google Pregel. By running a set of graph algorithms on an in-house cluster, our evaluation shows that our system could completely eliminate the communication delay in the best case and can achieve average 2X speedup over Hama.
  • Keywords
    Clustering algorithms; Computational modeling; Delays; Message systems; Partitioning algorithms; Roads; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on
  • Conference_Location
    Boston, MA, USA
  • Type

    conf

  • DOI
    10.1109/NAS.2015.7255213
  • Filename
    7255213