• DocumentCode
    987517
  • Title

    A Floating-Point Unit for 4D Vector Inner Product with Reduced Latency

  • Author

    Kim, Donghyun ; Kim, Lee-Sup

  • Author_Institution
    Qualcomm Inc., San Diego, CA
  • Volume
    58
  • Issue
    7
  • fYear
    2009
  • fDate
    7/1/2009 12:00:00 AM
  • Firstpage
    890
  • Lastpage
    901
  • Abstract
    This paper presents the algorithm and implementation of a new high-performance functional unit for floating-point four-dimensional vector inner product (4D dot product; DP4), which is most frequently performed in 3D graphics application. The proposed IEEE-compliant DP4 unit computes Z = AB + CD + EF + GH in one path and keeps the intermediate rounding by IEEE-754 rounding to nearest even. The intermediate rounding is merged with shift alignment, and intermediate carry-propagated addition and normalization are omitted to reduce latency in the proposed architecture. The proposed DP4 unit is implemented with 0.18-mum CMOS technology and has 12.8-ns critical path delay, which is reduced by 45.5 percent compared to a previous DP4 implementation using discrete multipliers and adders. The proposed DP4 unit also reduces the cycle time of 3D graphics applications by 12.4 percent on the average compared to the usual 3D graphics FPU based on four-way multiply-add-fused units.
  • Keywords
    CMOS integrated circuits; computer graphic equipment; floating point arithmetic; 3D graphics application; 4D dot product; 4D vector inner product; CMOS technology; IEEE-compliant DP4 unit; critical path delay; floating-point unit; high-performance functional unit; intermediate carry-propagated addition; size 0.18 mum; Acceleration; CMOS technology; Computer architecture; Delay; Engines; Floating-point arithmetic; Graphical user interfaces; Graphics; Hardware; Mobile computing; Pipelines; 3D graphics.; DP4; Floating point arithmetic; Floating-point arithmetic; Graphics processors; Vector inner product; vector inner product;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2008.210
  • Filename
    4674343