DocumentCode
3075068
Title
Analyzing throughput of GPGPUs exploiting within-die core-to-core frequency variation
Author
Lee, Jungseob ; Ajgaonkar, Paritosh Pratap ; Kim, Nam Sung
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin-Madison, Madison, WI, USA
fYear
2011
fDate
10-12 April 2011
Firstpage
237
Lastpage
246
Abstract
The state-of-the-art general-purpose graphic processing units (GPGPUs) can offer very high computational throughput for general-purpose, highly-parallel applications using hundreds of available on-chip cores. Meanwhile, as technology is scaled down below 65nm, each core´s maximum frequency varies significantly due to increasing within-die variations. This, in turn, diminishes the throughput improvement of GPGPUs through technology scaling because the maximum frequency is often limited by the slowest core. In this paper, we investigate two techniques that can mitigate the impact of frequency variations on GPGPU´s throughput: 1) running each core at its maximum frequency independently and 2) disabling the slowest cores. Both can maximize GPGPU´s frequency at either the individual core or entire processor level. Our experimental results using a GPGPU simulator and a 32nm technology show that the first and second techniques can improve the throughput of compute- and problem-size-bounded applications by up to 32% and 19%, respectively.
Keywords
computer graphic equipment; coprocessors; variational techniques; GPGPU simulator; core-to-core frequency variation; general-purpose graphic processing units; technology scaling; within-die variations; Bandwidth; Clocks; Graphics processing unit; Instruction sets; Multicore processing; System-on-a-chip; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
Conference_Location
Austin, TX
Print_ISBN
978-1-61284-367-4
Electronic_ISBN
978-1-61284-368-1
Type
conf
DOI
10.1109/ISPASS.2011.5762740
Filename
5762740
Link To Document