• DocumentCode
    2999092
  • Title

    A Power Provision and Capping Architecture for Large Scale Systems

  • Author

    Liu, Yongpeng ; Zhu, Hong ; Lu, Kai ; Liu, Yongyan

  • Author_Institution
    Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    954
  • Lastpage
    963
  • Abstract
    The rapid growth of large scale computing systems imposes a grave challenge to their power management, where power provision and capping is essential. In this paper, we propose a new architecture of power provision and capping to control the power consumption of large scale clusters. In this architecture, performance sensitive computation units are distinguished from those having less impact on system performance. A subset of units is monitored and their operation states are controlled in order to maintain whole system´s total power consumption under budget. Two policies are designed and implemented to select the target subset of nodes for power regulation. One policy is state-based, which chooses nodes running the most power consuming job for power regulation. The other is change-based, which chooses those nodes that runs a job whose power consumption increases most rapidly among all jobs. Experiments have been conducted on the Tianhe-1A supercomputer system to evaluate the effectiveness of these power capping solutions. The experiments demonstrated that the new architecture can ensure power usage safety with only a negligible decline of performance, which is only about 2%.
  • Keywords
    power aware computing; power consumption; Tianhe-1A supercomputer system; capping architecture; large scale cluster; large scale computing system; performance sensitive computation unit; power capping; power consumption control; power management; power provision; power regulation; power usage safety; Computer architecture; Computers; Large-scale systems; Measurement; Monitoring; Power demand; Safety; Large-scale system; Metrics; Power capping; Power control architecture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.117
  • Filename
    6270742