Title :
Scalable Graph Exploration on Multicore Processors
Author :
Agarwal, Virat ; Petrini, Fabrizio ; Pasetto, Davide ; Bader, David A.
Author_Institution :
IBM TJ Watson, Yorktown Heights, NY, USA
Abstract :
Many important problems in computational sciences, social network analysis, security, and business analytics, are data-intensive and lend themselves to graph-theoretical analyses. In this paper we investigate the challenges involved in exploring very large graphs by designing a breadth-first search (BFS) algorithm for advanced multi-core processors that are likely to become the building blocks of future exascale systems. Our new methodology for large-scale graph analytics combines a highlevel algorithmic design that captures the machine-independent aspects, to guarantee portability with performance to future processors, with an implementation that embeds processorspecific optimizations. We present an experimental study that uses state-of-the-art Intel Nehalem EP and EX processors and up to 64 threads in a single system. Our performance on several benchmark problems representative of the power-law graphs found in real-world problems reaches processing rates that are competitive with supercomputing results in the recent literature. In the experimental evaluation we prove that our graph exploration algorithm running on a 4-socket Nehalem EX is (1) 2.4 times faster than a Cray XMT with 128 processors when exploring a random graph with 64 million vertices and 512 millions edges, (2) capable of processing 550 million edges per second with an R-MAT graph with 200 million vertices and 1 billion edges, comparable to the performance of a similar graph on a Cray MTA-2 with 40 processors and (3) 5 times faster than 256 BlueGene/L processors on a graph with average degree 50.
Keywords :
graph theory; multiprocessing systems; EX processors; Intel Nehalem EP; R-MAT graph; breadth-first search algorithm; business analytics; computational sciences; graph-theoretical analyses; highlevel algorithmic design; machine-independent aspects; multicore processors; scalable graph exploration; social network analysis; Algorithm design and analysis; Instruction sets; Multicore processing; Sockets; Synchronization;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7557-5
Electronic_ISBN :
978-1-4244-7558-2