DocumentCode :
560149
Title :
Parallel breadth-first search on distributed memory systems
Author :
Buluç, Aydin ; Madduri, Kamesh
Author_Institution :
Comput. Res. Div., Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
fYear :
2011
fDate :
12-18 Nov. 2011
Firstpage :
1
Lastpage :
12
Abstract :
Data-intensive, graph-based computations are pervasive in several scientific applications, and are known to to be quite challenging to implement on distributed memory systems. In this work, we explore the design space of parallel algorithms for Breadth-First Search (BFS), a key subroutine in several graph algorithms. We present two highly-tuned parallel approaches for BFS on large parallel systems: a level-synchronous strategy that relies on a simple vertex-based partitioning of the graph, and a two-dimensional sparse matrix partitioning-based approach that mitigates parallel communication overhead. For both approaches, we also present hybrid versions with intra-node multithreading. Our novel hybrid two-dimensional algorithm reduces communication times by up to a factor of 3.5, relative to a common vertex based approach. Our experimental study identifies execution regimes in which these approaches will be competitive, and we demonstrate extremely high performance on leading distributed-memory parallel systems. For instance, for a 40,000-core parallel execution on Hopper, an AMD Magny-Cours based system, we achieve a BFS performance rate of 17.8 billion edge visits per second on an undirected graph of 4.3 billion vertices and 68.7 billion edges with skewed degree distribution.
Keywords :
distributed memory systems; graph theory; multi-threading; parallel algorithms; sparse matrices; tree searching; 2D sparse matrix partitioning; AMD Magny-Cours based system; BFS performance; Hopper; data-intensive graph-based computation; design space; distributed-memory parallel system; graph algorithm; intra-node multithreading; level-synchronous strategy; parallel algorithm; parallel breadth-first search; parallel communication overhead; skewed degree distribution; undirected graph; vertex-based graph partitioning; Algorithm design and analysis; Message systems; Optimization; Partitioning algorithms; Program processors; Sparse matrices; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
Conference_Location :
Seatle, WA
Electronic_ISBN :
978-1-4503-0771-0
Type :
conf
Filename :
6114414
Link To Document :
بازگشت