Title : 
Thermal, power, and co-location aware resource allocation in heterogeneous high performance computing systems
         
        
            Author : 
Oxley, Mark A. ; Jonardi, Eric ; Pasricha, Sudeep ; Maciejewski, Anthony A. ; Koenig, Gregory A. ; Siegel, Howard Jay
         
        
            Author_Institution : 
Dept. of Electr. & Comput. Eng., Colorado State Univ., Fort Collins, CO, USA
         
        
        
        
        
        
            Abstract : 
The rapid increase in power consumption of high performance computing (HPC) systems has led to an increase in the amount of cooling resources required to operate these facilities at a reliable threshold. The cooling systems contribute a large portion of the total power consumption of the facility, thus driving up the costs of providing power to these facilities. In addition, when cores sharing resources (e.g., last-level cache) execute applications at the same time, they can experience contention and therefore performance degradation. By taking a holistic approach to HPC facility management through intelligently allocating both computing and cooling resources, the performance of the HPC system can be maximized by considering co-location while obeying power consumption and thermal constraints. The performance of the system is quantified as the total reward earned from completing tasks by their individual deadlines. We propose three novel resource allocation techniques to maximize performance under power and thermal constraints when considering co-location effects: (1) a greedy heuristic, (2) a genetic algorithm technique used in combination with a new local search technique that guarantees the power and thermal constraints, and (3) a nonlinear programming based approach (from previous work), adapted to consider co-location effects.
         
        
            Keywords : 
cooling; genetic algorithms; greedy algorithms; nonlinear programming; parallel processing; performance evaluation; power aware computing; resource allocation; search problems; HPC facility management; colocation aware resource allocation; computing resource allocation; cooling resource allocation; cooling resources; cooling systems; genetic algorithm technique; greedy heuristic; heterogeneous high performance computing systems; last-level cache; local search technique; nonlinear programming based approach; performance degradation; power aware resource allocation; power consumption; resource sharing; thermal aware resource allocation; thermal constraints; Cooling; Degradation; Interference; Power demand; Program processors; Resource management; Servers; DVFS; data center; heterogeneous computing; memory interference; power-aware computing; resource management; thermal-aware computing;
         
        
        
        
            Conference_Titel : 
Green Computing Conference (IGCC), 2014 International
         
        
            Conference_Location : 
Dallas, TX
         
        
        
            DOI : 
10.1109/IGCC.2014.7039148