Title :
High performance floating-point unit with 116 bit wide divider
Author :
Gerwig, Guenter ; Wetter, Holger ; Schwarz, Eric M. ; Haess, Juergen
Author_Institution :
IBM Server Div., USA
Abstract :
The next generation zSeries floating-point unit is unveiled which is the first IBM mainframe with a fused multiply-add dataflow. It supports both S/390 hexadecimal floating-point architecture and the IEEE 754 binary floating-point architecture which was first implemented in S/390 on the 1998 S/390 G5 floating-point unit. The new floating-point unit supports a total of 6 formats including single, double, and quadword formats implemented in hardware. The floating-point pipeline is 5 cycles with a throughput of 1 multiply-add per cycle. Both hexadecimal and binary floating-point instructions are capable of this performance due to a novel way of handling both formats. Other key developments include new methods for handling denormalized numbers and quad precision divide engine dataflow. This divider uses a radix-4 SRT algorithm and is able to handle quad precision divides in multiple floating-point and fixed-point formats. The number of iterations for fixed-point divisions depend on the effective number of quotient bits. It uses a reduced carry-save form for the partial remainder, with only 1 carry bit for every 4 sum bits, to save area and power.
Keywords :
carry logic; data flow computing; fixed point arithmetic; floating point arithmetic; mainframes; parallel architectures; IEEE 754 binary floating-point architecture; S/390 hexadecimal floating-point architecture; carry-save form; denormalized numbers; fixed-point format; floating-point pipeline; multiply-add dataflow; quad precision divide engine; radix-4 SRT algorithm; zSeries floating-point unit; Digital arithmetic;
Conference_Titel :
Computer Arithmetic, 2003. Proceedings. 16th IEEE Symposium on
Print_ISBN :
0-7695-1894-X
DOI :
10.1109/ARITH.2003.1207664