• DocumentCode
    3077973
  • Title

    A Scheduler-Level Incentive Mechanism for Energy Efficiency in HPC

  • Author

    Georgiou, Yiannis ; Glesser, David ; Rzadca, Krzysztof ; Trystram, Denis

  • Author_Institution
    BULL, France
  • fYear
    2015
  • fDate
    4-7 May 2015
  • Firstpage
    617
  • Lastpage
    626
  • Abstract
    Energy consumption has become one of the most important factors in High Performance Computing platforms. However, while there are various algorithmic and programming techniques to save energy, a user has currently no incentive to employ them, as they might result in worse performance. We propose to manage the energy budget of a supercomputer through EnergyFairShare (EFS), a FairShare-like scheduling algorithm. FairShare is a classic scheduling rule that prioritizes jobs belonging to users who were assigned small amount of CPU-second in the past. Similarly, EFS keeps track of users ´consumption of Watt-seconds and prioritizes those whom jobs consumed less energy. Therefore, EFS incentives users to optimize their code for energy efficiency. Having higher priority, jobs have smaller queuing times and, thus, smaller turn-around time. To validate this principle, we implemented EFS in a scheduling simulator and processed workloads from various HPC centers. The results show that, by reducing it energy consumption, auser will reduce it stretch (slowdown), compared to increasing it energy consumption. To validate the general feasibility odour approach, we also implemented EFS as an extension forSLURM, a popular HPC resource and job management system.We validated our plugin both by emulating a large scale platform, and by experiments upon a real cluster with monitored energy consumption. We observed smaller waiting times for energy efficient users.
  • Keywords
    mainframes; parallel machines; parallel processing; power aware computing; queueing theory; scheduling; EnergyFairShare; FairShare-like scheduling algorithm; HPC centers; HPC job management system; HPC resource management system; SLURM; algorithmic techniques; energy consumption; energy efficiency; energy efficient users; high performance computing platforms; incentives; programming techniques; queuing times; scheduler-level incentive mechanism; scheduling rule; scheduling simulator; supercomputer; Energy consumption; Hardware; Monitoring; Power measurement; Program processors; Standards; Supercomputers; Energy-Aware; Energy-Efficiency; FairShare; Resource and Job Management Systems; Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CCGrid.2015.101
  • Filename
    7152527