Title : 
Series approximation methods for divide and square root in the Power3TM processor
         
        
            Author : 
Agarwal, Ramesh C. ; Gustavson, Fred G. ; Schmookler, Martin S.
         
        
            Author_Institution : 
Res. Div., IBM Corp., Yorktown, NY, USA
         
        
        
        
        
        
            Abstract : 
The Power3 processor is a 64-bit implementation of the PowerPCTM architecture and is the successor to the Power2TM processor for workstations and servers which require high performance floating point capability. The previous processors used Newton-Raphson algorithms for their implementations of divide and square root. The Power3 processor has a longer pipeline latency, which would substantially increase the latency for these instructions. Instead, new algorithms based on power series approximations were developed which provide significantly better performance than the Newton-Raphson algorithm for this processor. This paper describes the algorithms, and then shows how both the series based algorithms and the Newton-Raphson algorithms are affected by pipeline length. For the Power3, the power series algorithms reduce the divide latency by over 20% and the square root latency by 35%
         
        
            Keywords : 
Newton-Raphson method; floating point arithmetic; pipeline arithmetic; Newton-Raphson algorithms; Power3 processor; Power3TM processor; divide; high performance floating point; pipeline; power series approximations; square root; Algorithm design and analysis; Approximation methods; Delay; Frequency; Hardware; Microprocessor chips; Pipelines; Reduced instruction set computing; Silicon; Table lookup;
         
        
        
        
            Conference_Titel : 
Computer Arithmetic, 1999. Proceedings. 14th IEEE Symposium on
         
        
            Conference_Location : 
Adelaide, SA
         
        
        
            Print_ISBN : 
0-7695-0116-8
         
        
        
            DOI : 
10.1109/ARITH.1999.762836