• DocumentCode
    3089687
  • Title

    An Energy Manager for High Performance Computer Clusters

  • Author

    Alvarruiz, Fernando ; De Alfonso, Carlos ; Caballer, Miguel ; Hern´ndez, V.

  • Author_Institution
    Inst. de Instrumentacion para Imagen Mol. (I3M), Univ. Politec. de Valencia, Valencia, Spain
  • fYear
    2012
  • fDate
    10-13 July 2012
  • Firstpage
    231
  • Lastpage
    238
  • Abstract
    This paper presents a general energy management system for HPC clusters and cloud infrastructures that powers off cluster nodes when they are not being used, and conversely powers them on when they are needed. This system can be integrated with different HPC cluster middleware, such as Batch-Queuing Systems or Cloud Management Systems, by using a set of connectors, and is also able to deal with different mechanisms for powering on and off the computing nodes (such as Wake-on-Lan, Power Device Units, Intelligent Platform Management Interface or other infrastructure-specific mechanisms). While some existing Batch-Queuing Systems provide energy saving mechanisms, other popular choices lack this feature. Cloud management middleware do not generally provide this feature out of the box, and incorporating it implies making modifications to the middleware. The advantage of our approach is that it can be integrated with different resource management middleware, without needing any modification of that middleware. The paper describes the successful integration of the system proposed with the popular Torque/PBS management system, and also with the OpenNebula open source cloud management tool. Two real use-cases are presented, involving two different HPC clusters. These use cases show significant energy/costs savings of 38% and 16%.
  • Keywords
    cloud computing; middleware; queueing theory; workstation clusters; HPC cluster; Torque-PBS management system; Wake-on-Lan; batch-queuing system; cloud infrastructure; cloud management system; energy manager; energy saving mechanism; general energy management system; high performance computer cluster; infrastructure-specific mechanism; intelligent platform management interface; middleware; power device unit; Booting; Computers; Connectors; Middleware; Resource management; Switches; Virtual machining; HPC; cloud computing; energy management; green computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
  • Conference_Location
    Leganes
  • Print_ISBN
    978-1-4673-1631-6
  • Type

    conf

  • DOI
    10.1109/ISPA.2012.38
  • Filename
    6280297