DocumentCode :
1983077
Title :
A Reconfigurable Computing Approach for Efficient and Scalable Parallel Graph Exploration
Author :
Betkaoui, Brahim ; Wang, Yu ; Thomas, David B. ; Luk, Wayne
Author_Institution :
Dept. of Comput., Imperial Coll. London, London, UK
fYear :
2012
fDate :
9-11 July 2012
Firstpage :
8
Lastpage :
15
Abstract :
In many application domains, data are represented using large graphs involving millions of vertices and billions of edges. Graph exploration algorithms, such as breadth-first search (BFS), are largely dominated by memory latency and are challenging to process efficiently. In this paper, we present a reconfigurable hardware methodology for efficient parallel processing of large-scale graph exploration problems. Our methodology is based on a reconfigurable hardware architecture which decouples computation and communication while keeping multiple memory requests in flight at any given time, taking advantage of the hardware capabilities of both FPGAs and the parallel memory subsystem. To validate our methodology, we provide a detailed design description of the Breadth-First Search algorithm on an FPGA-based high performance computing system. Using graph data based on the power-law graphs found in real-word problems, we are able to achieve performance results that are superior to those of high performance multi-core systems in the recent literature for large graph instances, and a throughput in excess of 2.5 billion traversed edges per second on RMAT graphs with 16 million vertices and over a billion edges. Using four Virtex-5 LX330 FPGAs based on 65nm technology and running at 75MHz, our BFS design achieves more than twice the speed of a 32-core Xeon X7560 based on 45nm technology and running at 2.26GHz.
Keywords :
field programmable gate arrays; graph theory; multiprocessing systems; parallel processing; tree searching; BFS; FPGA; breadth first search; memory latency; multi-core systems; parallel memory subsystem; power law graphs; reconfigurable computing approach; reconfigurable hardware architecture; scalable parallel graph exploration; Algorithm design and analysis; Arrays; Field programmable gate arrays; Hardware; Kernel; Random access memory; Breadth-First Search (BFS); FPGA; Parallel graph algorithms; hardware acceleration; high performance reconfigurable computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application-Specific Systems, Architectures and Processors (ASAP), 2012 IEEE 23rd International Conference on
Conference_Location :
Delft
ISSN :
2160-0511
Print_ISBN :
978-1-4673-2243-0
Electronic_ISBN :
2160-0511
Type :
conf
DOI :
10.1109/ASAP.2012.30
Filename :
6341448
Link To Document :
بازگشت