DocumentCode :
85376
Title :
A 2.05 GVertices/s 151 mW Lighting Accelerator for 3D Graphics Vertex and Pixel Shading in 32 nm CMOS
Author :
Sheikh, Farhana ; Mathew, Sanu K. ; Anders, Mark A. ; Kaul, Himanshu ; Hsu, S.K. ; Agarwal, Abhishek ; Krishnamurthy, Ram K. ; Borkar, Shekhar
Author_Institution :
Circuit Res. Lab., Intel Corp., Hillsboro, OR, USA
Volume :
48
Issue :
1
fYear :
2013
fDate :
Jan. 2013
Firstpage :
128
Lastpage :
139
Abstract :
This paper describes a single-cycle throughput lighting accelerator fabricated in 1.05 V, 32 nm CMOS for on-die acceleration of 3D graphics vertex and pixel shading in high-performance processors and mobile systems-on-chip. Log-domain parallel computation of ambient, diffuse, and specular lighting using high-accuracy 32b log and anti-log units that convert computation from floating-point (FP) to fixed-point domain, 32b sparse-tree fixed-point adders and a 32 × 32b signed fixed-point multiplier with truncated partial product reduction tree enable 2.05 GVertices/s throughput measured at 1.05 V, 25°C in an area of 0.064 mm2 while achieving: (i) 47% reduction in critical path logic stages compared to previously published work, (ii) 0.56% mean vertex lighting error compared to single-precision FP computation, (iii) 354 μW active leakage power measured at 1.05 V, 25 °C, (iv) scalable performance up to 2.22 GHz, 232 mW measured at 1.2 V, (v) peak energy-efficiency of 56 GVertices/s/W, measured at 560 mV, 25 °C, and (vi) 119.6 dB PSNR for a 2 M pixel high-resolution 3D image.
Keywords :
CMOS integrated circuits; adders; image resolution; parallel processing; system-on-chip; trees (mathematics); 3D graphics vertex; CMOS; PSNR; active leakage power; antilog unit; critical path logic; fixed-point domain; floating-point; frequency 2.22 GHz; high-accuracy log unit; high-performance processor; log-domain parallel computation; mean vertex lighting error; mobile systems-on-chip; noise figure 119.6 dB; on-die acceleration; peak energy-efficiency; pixel high-resolution 3D image; pixel shading; power 151 mW; power 232 mW; power 354 muW; signed fixed-point multiplier; single-cycle throughput lighting accelerator; single-precision FP computation; size 32 nm; sparse-tree fixed-point adder; specular lighting; temperature 25 C; truncated partial product reduction tree; voltage 1.05 V; voltage 1.2 V; voltage 560 V; Acceleration; Accuracy; Approximation methods; Equations; Graphics; Lighting; Mathematical model; 3D graphics; Phong illumination; fixed-point; lighting acceleration; log arithmetic; vertex and pixel shading;
fLanguage :
English
Journal_Title :
Solid-State Circuits, IEEE Journal of
Publisher :
ieee
ISSN :
0018-9200
Type :
jour
DOI :
10.1109/JSSC.2012.2222813
Filename :
6374707
Link To Document :
بازگشت