Title :
Floating-point fused multiply-add with reduced latency
Author :
Lang, Tomas ; Bruguera, Javier D.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
Abstract :
We propose an architecture for the computation of the floating-point multiply-add-fused (MAF) operation A+ (B × C). This architecture is based on the combined addition and rounding (using a dual adder) and on the anticipation of the normalization step before the addition. Because the normalization is performed before the addition, it is not possible to overlap the leading-zero-anticipator with the adder. Consequently, to avoid the increase in delay we modify the design of the LZA so that the leading bits of its output are produced first and can be used to begin the normalization. Moreover, parts of the addition are also anticipated. We have estimated the delay of the resulting architecture for double-precision format, considering the load introduced by long connections, and estimate a reduction of about 15% to 20% with respect to traditional implementations of the floating-point MAF unit.
Keywords :
floating point arithmetic; logic design; delay; double-precision format; floating-point; floating-point MAF; floating-point multiply-add-fused operation; normalization; Adders; Computer architecture; Contracts; Delay effects; Delay estimation; Hardware; Very large scale integration;
Conference_Titel :
Computer Design: VLSI in Computers and Processors, 2002. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1700-5
DOI :
10.1109/ICCD.2002.1106762