Title :
Pruning Strategies in Adaptive Off-Line Tuning for Optimized Composition of Components on Heterogeneous Systems
Author :
Lu Li ; Dastgeer, Usman ; Kessler, Christoph
Author_Institution :
IDA, Linkoping Univ., Linkoping, Sweden
Abstract :
Adaptive program optimizations, such as automatic selection of the expected fastest implementation variant for a computation component depending on runtime context, are important especially for heterogeneous computing systems but require good performance models. Empirical performance models based on trial executions which require no or little human efforts show more practical feasibility if the sampling and training cost can be reduced to a reasonable level. In previous work we proposed an early version of adaptive pruning algorithm for efficient selection of training samples, a decision-tree based method for representing, predicting and selecting the fastest implementation variants for given run-time call context properties, and a composition tool for building the overall composed application from its components. For adaptive pruning we use a heuristic convexity assumption. In this paper we consolidate and improve the method by new pruning techniques to better support the convexity assumption and better control the trade-off between sampling time, prediction accuracy and runtime prediction overhead. Our results show that the training time can be reduced by up to 39 times without noticeable prediction accuracy decrease. Furthermore, we evaluate the effect of combinations of pruning strategies and compare our adaptive sampling method with random sampling. We also use our smart-sampling method as a preprocessor to a state-of-the-art decision tree learning algorithm and compare the result to the predictor directly calculated by our method.
Keywords :
decision trees; parallel processing; random processes; sampling methods; adaptive off-line tuning; adaptive program optimization; decision-tree; heterogeneous computing system; pruning strategy; random sampling; sampling time; smart-sampling method; Accuracy; Adaptation models; Benchmark testing; Context; Predictive models; Training; Tuning; Adaptive sampling; Autotuning; GPU; Heterogeneous computing; Implementation selection; Machine learning; Performance optimization;
Conference_Titel :
Parallel Processing Workshops (ICCPW), 2014 43rd International Conference on
DOI :
10.1109/ICPPW.2014.42