DocumentCode :
3321396
Title :
A 90mW/GFlop 3.4GHz Reconfigurable Fused/Continuous Multiply-Accumulator for Floating-Point and Integer Operands in 65nm
Author :
Jain, Shailendra ; Erraguntla, Vasantha ; Vangal, Sriram R. ; Hoskote, Yatin ; Borkar, Nitin ; Mandepudi, Tulasi ; Karthik, V.P.
Author_Institution :
Intel Corp., CA, USA
fYear :
2010
fDate :
3-7 Jan. 2010
Firstpage :
252
Lastpage :
257
Abstract :
This paper describes energy efficient and reconfigurable fused/continuous Multiply-Accumulator (MAC) architecture for single-precision Floating-point and 16-bit signed integer operands. This eight-stage pipelined and single-cycle throughput MAC design contains a bit level pipelined multiplier, followed by fast sparse-tree adder and single cycle accumulator loop with delayed normalization logic. Operation driven energy control is achieved using dynamic clock and fine grained power gating techniques. Power gating is employed in 98% of design to save 79% of leakage power in idle mode, at 1.2 V supply and 110 C. The use of fully shared logic in the multiplier, accumulator and normalization blocks for different operations enables a compact design of 0.54 mm2 containing 117 K transistors in eight-metal 65 nm CMOS technology. The 15-FO4 design provides 6.8 GFLOPS of performance with total energy efficiency of 90 mW/GFLOP at 1.2V and 3.4 GHz operation.
Keywords :
CMOS logic circuits; floating point arithmetic; logic design; microprocessor chips; CMOS technology; bit level pipelined multiplier; delayed normalization logic; dynamic clock; eight-stage pipelined MAC design; fast sparse-tree adder; fine grained power gating techniques; floating-point operands; frequency 3.4 GHz; fully shared logic; operation driven energy control; power 90 mW; reconfigurable fused/continuous multiply-accumulator architecture; signed integer operands; single cycle accumulator loop; single-cycle throughput MAC design; size 65 nm; temperature 110 C; voltage 1.2 V; word length 16 bit; Adders; CMOS technology; Computer architecture; Delay; Energy efficiency; Logic design; Power control; Reconfigurable logic; Signal generators; Throughput; Floating-point; Fused and continuous MAC; VLSI; multiply-accumulate;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
VLSI Design, 2010. VLSID '10. 23rd International Conference on
Conference_Location :
Bangalore
ISSN :
1063-9667
Print_ISBN :
978-1-4244-5541-6
Type :
conf
DOI :
10.1109/VLSI.Design.2010.59
Filename :
5401328
Link To Document :
بازگشت