DocumentCode
1514431
Title
Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction
Author
Tai, Yi-Gang ; Lo, Chia-Tien Dan ; Psarris, Kleanthis
Author_Institution
Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX, USA
Volume
23
Issue
2
fYear
2012
Firstpage
202
Lastpage
210
Abstract
Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation. If the core operator of the reduction is deeply pipelined, which is usually the case, dependencies between the input data elements cause data hazards. To tackle this problem, we propose a new reduction method with low latency and high pipeline utilization. The performance of the proposed design is evaluated for both single data set and multiple data set scenarios. Further, QR decomposition is used to demonstrate how the proposed method can accelerate its execution. We implement the design on an FPGA and compare its results to other methods.
Keywords
digital arithmetic; field programmable gate arrays; matrix algebra; FPGA; QR decomposition; improved deeply pipelined vector reduction; matrix operations; Adders; Clocks; Hardware; Merging; Pipeline processing; Pipelines; Registers; Reconfigurable hardware; algorithm design and analysis.; parallel algorithms; parallel and vector implementations; pipeline processors;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2011.141
Filename
5765947
Link To Document