DocumentCode
3089687
Title
An Energy Manager for High Performance Computer Clusters
Author
Alvarruiz, Fernando ; De Alfonso, Carlos ; Caballer, Miguel ; Hern´ndez, V.
Author_Institution
Inst. de Instrumentacion para Imagen Mol. (I3M), Univ. Politec. de Valencia, Valencia, Spain
fYear
2012
fDate
10-13 July 2012
Firstpage
231
Lastpage
238
Abstract
This paper presents a general energy management system for HPC clusters and cloud infrastructures that powers off cluster nodes when they are not being used, and conversely powers them on when they are needed. This system can be integrated with different HPC cluster middleware, such as Batch-Queuing Systems or Cloud Management Systems, by using a set of connectors, and is also able to deal with different mechanisms for powering on and off the computing nodes (such as Wake-on-Lan, Power Device Units, Intelligent Platform Management Interface or other infrastructure-specific mechanisms). While some existing Batch-Queuing Systems provide energy saving mechanisms, other popular choices lack this feature. Cloud management middleware do not generally provide this feature out of the box, and incorporating it implies making modifications to the middleware. The advantage of our approach is that it can be integrated with different resource management middleware, without needing any modification of that middleware. The paper describes the successful integration of the system proposed with the popular Torque/PBS management system, and also with the OpenNebula open source cloud management tool. Two real use-cases are presented, involving two different HPC clusters. These use cases show significant energy/costs savings of 38% and 16%.
Keywords
cloud computing; middleware; queueing theory; workstation clusters; HPC cluster; Torque-PBS management system; Wake-on-Lan; batch-queuing system; cloud infrastructure; cloud management system; energy manager; energy saving mechanism; general energy management system; high performance computer cluster; infrastructure-specific mechanism; intelligent platform management interface; middleware; power device unit; Booting; Computers; Connectors; Middleware; Resource management; Switches; Virtual machining; HPC; cloud computing; energy management; green computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
Conference_Location
Leganes
Print_ISBN
978-1-4673-1631-6
Type
conf
DOI
10.1109/ISPA.2012.38
Filename
6280297
Link To Document