• DocumentCode
    174682
  • Title

    Fair share: Allocation of GPU resources for both performance and fairness

  • Author

    Aguilera, Pedro ; Morrow, Katherine ; Nam Sung Kim

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Wisconsin-Madison, Madison, WI, USA
  • fYear
    2014
  • fDate
    19-22 Oct. 2014
  • Firstpage
    440
  • Lastpage
    447
  • Abstract
    General-purpose computing on the GPU (GPGPU computing) is becoming widely adopted for an increasing variety of applications. However, it has been shown that as the available computing elements in the GPU increase with every generation some GPGPU applications fail to fully utilize the GPU resources. Spatial multitasking-subdividing GPU resources amongst concurrently-running applications-has been shown to increase overall system performance and utilization for GPGPU computing. However, dividing the computing resources among multiple applications to maximize system performance often results in one application having “unfair” access to GPU resources. Yet, evenly dividing resources among applications does not guarantee equal speedups to each application; nor does it take into account overall system performance. In this paper we examine several different ways to characterize “fairness” for GPGPU spatial multitasking, by balancing individual application´s performance and overall system performance. We further present a run-time algorithm to predict and adjust the SM allocation at runtime to meet the desired fairness metric.
  • Keywords
    graphics processing units; multiprocessing programs; performance evaluation; resource allocation; GPGPU computing; GPGPU spatial multitasking; GPU resource allocation; GPU resource subdividing; SM allocation adjustment; SM allocation prediction; concurrently-running applications; general-purpose computing; overall system performance; run-time algorithm; streaming multiprocessors; Graphics processing units; Instruction sets; Kernel; Multitasking; Resource management; Throughput; Transform coding; GPGPU computing; fairness; resource allocation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design (ICCD), 2014 32nd IEEE International Conference on
  • Conference_Location
    Seoul
  • Type

    conf

  • DOI
    10.1109/ICCD.2014.6974717
  • Filename
    6974717