DocumentCode :
2606333
Title :
Towards a Universal FPGA Matrix-Vector Multiplication Architecture
Author :
Kestur, Srinidhi ; Davis, John D. ; Chung, Eric S.
fYear :
2012
fDate :
April 29 2012-May 1 2012
Firstpage :
9
Lastpage :
16
Abstract :
We present the design and implementation of a universal, single-bit stream library for accelerating matrix-vector multiplication using FPGAs. Our library handles multiple matrix encodings ranging from dense to multiple sparse formats. A key novelty in our approach is the introduction of a hardware-optimized sparse matrix representation called Compressed Variable-Length Bit Vector (CVBV), which reduces the storage and bandwidth requirements up to 43% (on average 25%) compared to compressed sparse row (CSR) across all the matrices from the University of Florida Sparse Matrix Collection. Our hardware incorporates a runtime-programmable decoder that performs on-the-fly-decoding of various formats such as Dense, COO, CSR, DIA, and ELL. The flexibility and scalability of our design is demonstrated across two FPGA platforms: (1) the BEE3 (Virtex-5 LX155T with 16GB of DRAM) and (2) ML605 (Virtex-6 LX240T with 2GB of DRAM). For dense matrices, our approach scales to large data sets with over 1 billion elements, and achieves robust performance independent of the matrix aspect ratio. For sparse matrices, our approach using a compressed representation reduces the overall bandwidth while also achieving comparable efficiency relative to state-of-the-art approaches.
Keywords :
field programmable gate arrays; mathematics computing; matrix multiplication; BEE3; ML605; University of Florida Sparse Matrix Collection; Virtex-5 LX155T; Virtex-6 LX240T; compressed sparse row; compressed variable-length bit vector; hardware-optimized sparse matrix representation; multiple matrix encodings; on-the-fly-decoding; runtime-programmable decoder; universal FPGA matrix-vector multiplication architecture; universal single-bit stream library; Arrays; Decoding; Encoding; Field programmable gate arrays; Libraries; Sparse matrices; Vectors; FPGA; dense matrix; reconfigurable computing; spMV; sparse matrix;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Custom Computing Machines (FCCM), 2012 IEEE 20th Annual International Symposium on
Conference_Location :
Toronto, ON
Print_ISBN :
978-1-4673-1605-7
Type :
conf
DOI :
10.1109/FCCM.2012.12
Filename :
6239784
Link To Document :
بازگشت