DocumentCode
3738245
Title
Optimizing memory performance for FPGA implementation of pagerank
Author
Shijie Zhou;Charalampos Chelmis;Viktor K. Prasanna
Author_Institution
Ming Hsieh Dept. of Electrical Engineering, University of Southern California, Los Angeles, CA, USA
fYear
2015
Firstpage
1
Lastpage
6
Abstract
Recently, FPGA implementation of graph algorithms arising in many areas such as social networks has been studied. However, the irregular memory access pattern of graph algorithms makes obtaining high performance challenging. In this paper, we present an FPGA implementation of the classic PageRank algorithm. Our goal is to optimize the overall system performance, especially the cost of accessing the off-chip DRAM. We optimize the data layout so that most of memory accesses to the DRAM are sequential. Post-place-and-route results show that our design on a state-of-the-art FPGA can achieve a high clock rate of over 200 MHz. Based on a realistic DRAM access model, we build a simulator to estimate the execution time including memory access overheads. The simulation results show that our design achieves at least 96% of the theoretically best performance of the target platform. Compared with a baseline design, our optimized design dramatically reduces the number of random memory accesses and improves the execution time by at least 70%.
Keywords
"Random access memory","Field programmable gate arrays","Layout","Computational modeling","System-on-chip","Pipelines","Computer architecture"
Publisher
ieee
Conference_Titel
ReConFigurable Computing and FPGAs (ReConFig), 2015 International Conference on
Type
conf
DOI
10.1109/ReConFig.2015.7393332
Filename
7393332
Link To Document