Title :
Adaptive Configuration Selection for Power-Constrained Heterogeneous Systems
Author :
Bailey, Peter E. ; Lowenthal, David K. ; Ravi, Vignesh ; Rountree, Barry ; Schulz, Markus ; de Supinski, Bronis R.
Abstract :
As power becomes an increasingly important design factor in high-end supercomputers, future systems will likely operate with power limitations significantly below their peak power specifications. These limitations will be enforced through a combination of software and hardware power policies, which will filter down from the system level to individual nodes. Hardware is already moving in this direction by providing power-capping interfaces to the user. The power/performance trade-off at the node level is critical in maximizing the performance of power-constrained cluster systems, but is also complex because of the many interacting architectural features and accelerators that comprise the hardware configuration of a node. The key to solving this challenge is an accurate power/performance model that will aid in selecting the right configuration from a large set of available configurations. In this paper, we present a novel approach to generate such a model offline using kernel clustering and multivariate linear regression. Our model requires only two iterations to select a configuration, which provides a significant advantage over exhaustive search-based strategies. We apply our model to predict power and performance for different applications using arbitrary configurations, and show that our model, when used with hardware frequency-limiting, selects configurations with significantly higher performance at a given power limit than those chosen by frequency-limiting alone. When applied to a set of 36 computational kernels from a range of applications, our model accurately predicts power and performance, it maintains 91% of optimal performance while meeting power constraints 88% of the time. When the model violates a power constraint, it exceeds the constraint by only 6% in the average case, while simultaneously achieving 54% more performance than an oracle.
Keywords :
parallel machines; pattern clustering; regression analysis; software standards; adaptive configuration selection; design factor; hardware configuration; hardware frequency-limiting; hardware power policy; high-end supercomputer; kernel clustering; multivariate linear regression; node level; peak power specification; power limitation; power-capping interfaces; power-constrained cluster system; power-constrained heterogeneous systems; software power policy; Computational modeling; Graphics processing units; Kernel; Performance evaluation; Predictive models; Training; GPU APU power performance modeling power-constrained;
Conference_Titel :
Parallel Processing (ICPP), 2014 43rd International Conference on
Conference_Location :
Minneapolis MN
DOI :
10.1109/ICPP.2014.46