Title :
Matrix multiplication by an inexact systolic array
Author :
Ke Chen ; Lombardi, Fabrizio ; Jie Han
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
Abstract :
Different schemes for approximate computing of matrix multiplication (MM) in systolic arrays are presented in this manuscript. Inexact full adder cells are utilized in a processing element (PE) for the Baugh-Wooley multiplier and/or the final adder as circuits implementing the two computational steps required for MM. An extensive analysis and simulation-based assessment of three inexact schemes for the PE are pursued with respect to circuit level performance (such as delay, power consumption and number of transistors) and figures of merit of approximate computing (such as the error distance). The execution of MM in each PE results in an inexact computation affecting only the outputs of the same columns, so the extension of inexact computation to a systolic array can also be performed with very limited error. The discrete cosine transform as application of the proposed inexact systolic arrays, is evaluated; simulation results show that the proposed inexact array is very effective, incurring in a small error.
Keywords :
adders; approximation theory; matrix multiplication; systolic arrays; Baugh-Wooley multiplier; approximate computing; circuit level performance; discrete cosine transform; error distance; figures of merit; inexact full adder cells; matrix multiplication; processing element; systolic arrays; Adders; Arrays; Complexity theory; Delays; Discrete cosine transforms; Power dissipation; Transistors; Approximate computing; inexact design; matrix multiplication; systolic array;
Conference_Titel :
Nanoscale Architectures (NANOARCH), 2015 IEEE/ACM International Symposium on
Conference_Location :
Boston, MA
DOI :
10.1109/NANOARCH.2015.7180604