Title :
A case study of streaming storage format for sparse matrices
Author :
Jain-Mendon, Shweta ; Sass, Ron
Author_Institution :
Reconfigurable Comput. Syst. Lab., Univ. of North Carolina at Charlotte, Charlotte, NC, USA
Abstract :
The Field-Programmable Gate Array is an excellent match for the sparse matrix-vector multiply operation because of its enormous computational capacity and its ability to build a custom memory hierarchy that matches the memory access patterns of the operation. This paper describes a streaming sparse matrix format and a custom memory subsystem that decodes it on-the-fly for multiple compute units. Results show that the proposed design is very efficient and allows the sparse matrix to be streamed sequentially from off-chip RAM. On a Xilinx Virtex-4 device, this results in an average of 58% memory-bandwidth efficiency for one compute unit and 70% memory-bandwidth efficiency for two compute units. The design is capable of achieving an average of 47% of theoretical peak floating-point performance for each compute unit.
Keywords :
computational complexity; field programmable gate arrays; random-access storage; sparse matrices; Xilinx Virtex-4 device; case study; custom memory hierarchy; custom memory subsystem; field programmable gate array; memory access patterns; memory-bandwidth efficiency; off-chip RAM; sparse matrices; sparse matrix vector; streaming storage format; Bandwidth; Decoding; Finite element methods; Indexes; Performance evaluation; Sparse matrices; Vectors; Sparse Matrix; Storage Formats;
Conference_Titel :
Reconfigurable Computing and FPGAs (ReConFig), 2012 International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4673-2919-4
DOI :
10.1109/ReConFig.2012.6416788