Author :
Georgiou, Yiannis ; Glesser, David ; Rzadca, Krzysztof ; Trystram, Denis
Abstract :
Energy consumption has become one of the most important factors in High Performance Computing platforms. However, while there are various algorithmic and programming techniques to save energy, a user has currently no incentive to employ them, as they might result in worse performance. We propose to manage the energy budget of a supercomputer through EnergyFairShare (EFS), a FairShare-like scheduling algorithm. FairShare is a classic scheduling rule that prioritizes jobs belonging to users who were assigned small amount of CPU-second in the past. Similarly, EFS keeps track of users ´consumption of Watt-seconds and prioritizes those whom jobs consumed less energy. Therefore, EFS incentives users to optimize their code for energy efficiency. Having higher priority, jobs have smaller queuing times and, thus, smaller turn-around time. To validate this principle, we implemented EFS in a scheduling simulator and processed workloads from various HPC centers. The results show that, by reducing it energy consumption, auser will reduce it stretch (slowdown), compared to increasing it energy consumption. To validate the general feasibility odour approach, we also implemented EFS as an extension forSLURM, a popular HPC resource and job management system.We validated our plugin both by emulating a large scale platform, and by experiments upon a real cluster with monitored energy consumption. We observed smaller waiting times for energy efficient users.
Keywords :
mainframes; parallel machines; parallel processing; power aware computing; queueing theory; scheduling; EnergyFairShare; FairShare-like scheduling algorithm; HPC centers; HPC job management system; HPC resource management system; SLURM; algorithmic techniques; energy consumption; energy efficiency; energy efficient users; high performance computing platforms; incentives; programming techniques; queuing times; scheduler-level incentive mechanism; scheduling rule; scheduling simulator; supercomputer; Energy consumption; Hardware; Monitoring; Power measurement; Program processors; Standards; Supercomputers; Energy-Aware; Energy-Efficiency; FairShare; Resource and Job Management Systems; Scheduling;