DocumentCode
3678345
Title
A Machine-Learning Approach for Communication Prediction of Large-Scale Applications
Author
Nikela Papadopoulou;Georgios Goumas;Nectarios Koziris
Author_Institution
Sch. of Electr. &
fYear
2015
Firstpage
120
Lastpage
123
Abstract
In this paper we present a machine-learning approach to predict the total communication time of parallel applications. Communication time is heavily dependent on a very wide set of parameters relevant to the architecture, runtime configuration and application communication profile. We focus our study on parameters that can be easily extracted from the application and the process mapping ahead of execution. To this direction we define a small set of descriptive metrics and build a simple benchmark that can sweep over the parameter space in a straightforward way. We use this benchmarking data to train a robust multiple variable regression model which serves as our communication predictor. Our experimental results show notable accuracy in predicting the communication time of two indicative application kernels on a supercomputer utilizing from a few dozen to a few thousands processing cores.
Keywords
"Measurement","Benchmark testing","Resource management","Predictive models","Correlation","Mathematical model","Kernel"
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/CLUSTER.2015.27
Filename
7307574
Link To Document