DocumentCode :
267716
Title :
High speed special function unit for graphics processing unit
Author :
Qoutb, Abd-Elrahman G. ; El-Gunidy, Abdullah M. ; Tolba, Mohammed F. ; El-Moursy, Magdy A.
Author_Institution :
Electr. Eng. Dept., Fayoum Univ., Fayoum, Egypt
fYear :
2014
fDate :
16-18 Dec. 2014
Firstpage :
24
Lastpage :
29
Abstract :
A fixed-point ASIC design for high-speed, second-order, piecewise function approximation is presented. A Non-Uniform segmentation method based on Minimax approximation is used to get the interpolation coefficients. Non-Uniform segmentation, effectively, reduces the size of the coefficient table with a small area overhead for the address encoder. The proposed algorithm truncates the binary coefficients within the pre-al located error. Radix-eight Booth multipliers are used to reduce the number of partial products to, around one third of the traditional multiplication, hence speeding up the evaluation process. Very fast reduction trees with four-to-two compressors are used to reduce the number of the resulting partial products. Also, a new radix-eight sign template which reduces the overall area of the multipliers is proposed. Hybrid carry-look ahead, carry-ripple adders are, also, used. The design has been verified on FPGA Moreover, 45nm PDK is used to synthesize and layout the design. A maximum propagation delay of 5.251ns is achieved with a reduction of 19% in the total delay as compared to other traditional methods. A total chip area of 0.014mm2 is also achieved.
Keywords :
adders; field programmable gate arrays; function approximation; graphics processing units; interpolation; logic design; minimax techniques; FPGA; address encoder; application-specific integrated circuits; area overhead; binary coefficients; carry-look ahead; carry-ripple adders; coefficient table; field programmable gate array; fixed-point ASIC design; four-to-two compressors; graphics processing unit; high speed special function unit; interpolation coefficients; minimax approximation; nonuniform segmentation method; piecewise function approximation; radix-eight Booth multipliers; reduction trees; Adders; Approximation error; Delays; Graphics processing units; Polynomials; Table lookup; Booth Multiplier; GPU; Hybrid Multiplier; Minimax; Nmeric Function Generator (NFG); Non-Uniform Segmentation; Special Function Unit (SFU); Vertix Shader Processor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design & Test Symposium (IDT), 2014 9th International
Conference_Location :
Algiers
Type :
conf
DOI :
10.1109/IDT.2014.7038581
Filename :
7038581
Link To Document :
بازگشت