DocumentCode :
1063746
Title :
High-speed double-precision computation of reciprocal, division, square root, and inverse square root
Author :
Pineiro, Jose-Alejandro ; Bruguera, Javier Diaz
Author_Institution :
Dept. de Electron. y Comput., Santiago de Compostela Univ., Spain
Volume :
51
Issue :
12
fYear :
2002
fDate :
12/1/2002 12:00:00 AM
Firstpage :
1377
Lastpage :
1388
Abstract :
A new method for the high-speed computation of double-precision floating-point reciprocal, division, square root, and inverse square root operations is presented in this paper. This method employs a second-degree minimax polynomial approximation to obtain an accurate initial estimate of the reciprocal and the inverse square root values, and then performs a modified Goldschmidt iteration. The high accuracy of the initial approximation allows us to obtain double-precision results by computing a single Goldschmidt iteration, significantly reducing the latency of the algorithm. Two unfolded architectures are proposed: the first one computing only reciprocal and division operations, and the second one also including the computation of square root and inverse square root. The execution times and area costs for both architectures are estimated, and a comparison with other multiplicative-based methods is presented. The results of this comparison show the achievement of a lower latency than these methods, with similar hardware requirements.
Keywords :
floating point arithmetic; iterative methods; minimax techniques; polynomial approximation; accurate initial estimate; area costs; double-precision floating-point division operations; double-precision floating-point inverse square root operations; double-precision floating-point reciprocal operations; double-precision floating-point square root operations; execution times; hardware requirements; high-speed double-precision computation; latency; modified Goldschmidt iteration; multiplicative-based methods; second-degree minimax polynomial approximation; unfolded architectures; Approximation algorithms; Computer architecture; Convergence; Delay; Hardware; High performance computing; Iterative algorithms; Minimax techniques; Polynomials; Signal processing algorithms;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2002.1146704
Filename :
1146704
Link To Document :
بازگشت