DocumentCode :
2786036
Title :
GraphStep: A System Architecture for Sparse-Graph Algorithms
Author :
DeLorimier, Michael ; Kapre, Nachiket ; Mehta, Nikil ; Rizzo, Dominic ; Eslick, Ian ; Rubin, Raphael ; Uribe, Tomas E. ; Knight, Thomas F., Jr. ; DeHon, Andre
Author_Institution :
Dept. of Comput. Sci., California Inst. of Technol., Pasadena, CA
fYear :
2006
fDate :
24-26 April 2006
Firstpage :
143
Lastpage :
151
Abstract :
Many important applications are organized around long-lived, irregular sparse graphs (e.g., data and knowledge bases, CAD optimization, numerical problems, simulations). The graph structures are large, and the applications need regular access to a large, data-dependent portion of the graph for each operation (e.g., the algorithm may need to walk the graph, visiting all nodes, or propagate changes through many nodes in the graph). On conventional microprocessors, the graph structures exceed on-chip cache capacities, making main-memory bandwidth and latency the key performance limiters. To avoid this "memory wall," we introduce a concurrent system architecture for sparse graph algorithms that places graph nodes in small distributed memories paired with specialized graph processing nodes interconnected by a lightweight network. This gives us a scalable way to map these applications so that they can exploit the high-bandwidth and low-latency capabilities of embedded memories (e.g., FPGA Block RAMs). On typical spreading-activation queries on the ConceptNet Knowledge Base, a sample application, this translates into an order of magnitude speedup per FPGA compared to a state-of-the-art Pentium processor
Keywords :
field programmable gate arrays; graph theory; microcomputers; sparse matrices; system-on-chip; FPGA; GraphStep; distributed memories; embedded memories; graph nodes; graph structures; main-memory bandwidth; microprocessors; on-chip cache capacities; sparse graph algorithms; system architecture; Aggregates; Bandwidth; Computer architecture; Delay; Field programmable gate arrays; Hardware; Microprocessors; Numerical simulation; Random access memory; SDRAM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Custom Computing Machines, 2006. FCCM '06. 14th Annual IEEE Symposium on
Conference_Location :
Napa, CA
Print_ISBN :
0-7695-2661-6
Type :
conf
DOI :
10.1109/FCCM.2006.45
Filename :
4020903
Link To Document :
بازگشت