DocumentCode :
3222491
Title :
Fast PGAS Implementation of Distributed Graph Algorithms
Author :
Cong, Guojing ; Almasi, George ; Saraswat, Vijay
Author_Institution :
IBM TJ Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2010
fDate :
13-19 Nov. 2010
Firstpage :
1
Lastpage :
11
Abstract :
Due to the memory intensive workload and the erratic access pattern, irregular graph algorithms are notoriously hard to implement and optimize for high performance on distributed-memory systems. Although the PGAS paradigm proposed recently improves ease of programming, no high performance PGAS implementation of large-scale graph analysis is known. We present the first fast PGAS implementation of graph algorithms for the connected components and minimum spanning tree problems. By improving memory access locality, compared with the naive implementation, our implementation exhibits much better communication efficiency and cache performance on a cluster of SMPs. With additional algorithmic and PGASspecific optimizations, our implementation achieves significant speedups over both the best sequential implementation and the best single-node SMP implementation for large, sparse graphs with more than a billion edges.
Keywords :
distributed memory systems; parallel algorithms; trees (mathematics); SMP cluster; distributed graph algorithms; distributed-memory systems; erratic access pattern; fast PGAS; irregular graph algorithms; large-scale graph analysis; minimum spanning tree problems; parallel graph algorithms; Algorithm design and analysis; Clustering algorithms; Electronics packaging; Instruction sets; Optimization; Phase change random access memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7557-5
Electronic_ISBN :
978-1-4244-7558-2
Type :
conf
DOI :
10.1109/SC.2010.26
Filename :
5644844
Link To Document :
بازگشت