• DocumentCode
    1013735
  • Title

    Accelerating correctly rounded floating-point division when the divisor is known in advance

  • Author

    Brisebarre, Nicolas ; Muller, Jean-Michel ; Raina, Saurabh Kumar

  • Author_Institution
    Lab. LIP-IMAG, Ecole Normale Superieure de Lyon, France
  • Volume
    53
  • Issue
    8
  • fYear
    2004
  • Firstpage
    1069
  • Lastpage
    1072
  • Abstract
    We present techniques for accelerating the floating-point computation of x/y when y is known before x. The proposed algorithms are oriented toward architectures with available fused-mac operations. The goal is to get exactly the same result as with usual division with rounding to nearest. It is known that the advanced computation of 1/y allows performing correctly rounded division in one multiplication plus two fused-macs. We show algorithms that reduce this latency to one multiplication and one fused-mac. This is achieved if a precision of at least n+1 bits is available, where n is the number of mantissa bits in the target format, or if y satisfies some properties that can be easily checked at compile-time. This requires a double-word approximation of 1/y (we also show how to get it). Compilers to accelerate some numerical programs without loss of accuracy can use these techniques.
  • Keywords
    floating point arithmetic; optimising compilers; roundoff errors; computer arithmetic; double-word approximation; fused-mac operation; optimizing compiler; rounded floating-point division; Acceleration; Application software; Computer architecture; Costs; Delay; Floating-point arithmetic; Iterative algorithms; Program processors; Runtime; Testing; 65; Computer arithmetic; compilation optimization.; division by software; division with fused-mac; floating-point arithmetic;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2004.37
  • Filename
    1306999