Title : 
Fair share: Allocation of GPU resources for both performance and fairness
         
        
            Author : 
Aguilera, Pedro ; Morrow, Katherine ; Nam Sung Kim
         
        
            Author_Institution : 
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin-Madison, Madison, WI, USA
         
        
        
        
        
        
            Abstract : 
General-purpose computing on the GPU (GPGPU computing) is becoming widely adopted for an increasing variety of applications. However, it has been shown that as the available computing elements in the GPU increase with every generation some GPGPU applications fail to fully utilize the GPU resources. Spatial multitasking-subdividing GPU resources amongst concurrently-running applications-has been shown to increase overall system performance and utilization for GPGPU computing. However, dividing the computing resources among multiple applications to maximize system performance often results in one application having “unfair” access to GPU resources. Yet, evenly dividing resources among applications does not guarantee equal speedups to each application; nor does it take into account overall system performance. In this paper we examine several different ways to characterize “fairness” for GPGPU spatial multitasking, by balancing individual application´s performance and overall system performance. We further present a run-time algorithm to predict and adjust the SM allocation at runtime to meet the desired fairness metric.
         
        
            Keywords : 
graphics processing units; multiprocessing programs; performance evaluation; resource allocation; GPGPU computing; GPGPU spatial multitasking; GPU resource allocation; GPU resource subdividing; SM allocation adjustment; SM allocation prediction; concurrently-running applications; general-purpose computing; overall system performance; run-time algorithm; streaming multiprocessors; Graphics processing units; Instruction sets; Kernel; Multitasking; Resource management; Throughput; Transform coding; GPGPU computing; fairness; resource allocation;
         
        
        
        
            Conference_Titel : 
Computer Design (ICCD), 2014 32nd IEEE International Conference on
         
        
            Conference_Location : 
Seoul
         
        
        
            DOI : 
10.1109/ICCD.2014.6974717