DocumentCode :
1338958
Title :
An Area and Energy Efficient Inner-Product Processor for Serial-Link Bus Architecture
Author :
Meher, Manas Ranjan ; Jong, Ching Chuen ; Chang, Chip-Hong
Author_Institution :
Integrated Syst. Res. Lab., Nanyang Technol. Univ., Singapore, Singapore
Volume :
59
Issue :
12
fYear :
2012
Firstpage :
2945
Lastpage :
2955
Abstract :
A unique word-serial inner-product processor architecture is proposed to capitalize on the high-speed serial-link bus. To eliminate the input buffers and deserializers, partial products are generated immediately from the serial input data and accumulated by an array of small binary counters operating in parallel to form a reduced partial product matrix directly. The height of the resultant partial product matrix is reduced logarithmically, and hence the carry-save-adder tree needed to complete the inner-product computation is smaller and faster. The small binary counters act as active on-chip buffers to mitigate the workload of the partial product accumulator. Their ability to accumulate partial product bits faster than combinatorial full adder leads to a simple two-stage architecture of high throughput and low latency. The architecture consumes 46% less silicon area, 24% less energy per inner-product computation and 70% less total interconnect length than its merged arithmetic counterpart in 65 nm CMOS process. In addition, the architecture requires only 4 metal layers out of available 7 layers for signal and power routing. By emulating the on-chip serial-link bus architecture on both designs, it is demonstrated that the proposed design is most suited for high-speed on-chip serial-link bus architecture.
Keywords :
CMOS logic circuits; adders; combinational circuits; matrix algebra; system buses; trees (mathematics); CMOS process; active on-chip buffers; area inner-product processor; carry-save-adder tree; combinatorial full adder; deserializers; energy efficient inner-product processor; high-speed serial-link bus architecture; inner-product computation; input buffers; partial product accumulator; power routing; reduced partial product matrix; signal routing; size 65 nm; small binary counters; unique word-serial inner-product processor architecture; Adders; Computer architecture; Logic gates; Radiation detectors; System-on-a-chip; Vectors; ASIC implementation; binary multiplication; inner-product; merged arithmetic;
fLanguage :
English
Journal_Title :
Circuits and Systems I: Regular Papers, IEEE Transactions on
Publisher :
ieee
ISSN :
1549-8328
Type :
jour
DOI :
10.1109/TCSI.2012.2220471
Filename :
6359947
Link To Document :
بازگشت