Title :
Addressing Memory Wall Problem of Graph Computation in Reconfigurable System
Author :
Xu Wang;Linan Huang;Yongxin Zhu;Yipeng Zhou;Huwan Peng;Haifei Xiong
Author_Institution :
Sch. of Microelectron., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
Graph computation problems that exhibit irregular memory access patterns are known to show poor performance on multiprocessor architectures. Although recent studies use FPGA technology to tackle the memory wall problem of graph computation by adopting a massively multi-threaded architecture, the performance is still far less than optimal memory performance due to the long memory access latency. In this paper, we address the memory wall problem by taking advantage of sequential streaming bandwidth of external DRAM memory. First, we present an edge-streaming model that streams edges from external DRAM memory while makes random access to the set of vertices in on-chip SRAM, leading to a fully utilization of external memory bandwidth in burst mode. Second, we propose an on-chip distributed off-chip shared memory architecture with a high performance shuffle network to real-timely shuffle intermediate results, which significantly reduces the requirement for intermediate buffers and saves off-chip memory bandwidth. We further use PageRank as a case study to validate the effectiveness of the proposed architecture. Evaluation results on ML605 board show that our architecture can achieve up to 4× improvement in terms of performance to bandwidth ratio over previously published FPGA-based implementations.
Keywords :
"Random access memory","System-on-chip","Bandwidth","Computational modeling","Memory architecture","Engines"
Conference_Titel :
High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS), 2015 IEEE 17th International Conference on
DOI :
10.1109/HPCC-CSS-ICESS.2015.77