DocumentCode
692933
Title
Predicting application performance using supervised learning on communication features
Author
Jain, Nikhil ; Bhatele, Abhinav ; Robson, Michael P. ; Gamblin, Todd ; Kale, Laxmikant V.
Author_Institution
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2013
fDate
17-22 Nov. 2013
Firstpage
1
Lastpage
12
Abstract
Task mapping on torus networks has traditionally focused on either reducing the maximum dilation or average number of hops per byte for messages in an application. These metrics make simplified assumptions about the cause of network congestion, and do not provide accurate correlation with execution time. Hence, these metrics cannot be used to reasonably predict or compare application performance for different mappings. In this paper, we attempt to model the performance of an application using communication data, such as the communication graph and network hardware counters. We use supervised learning algorithms, such as randomized decision trees, to correlate performance with prior and new metrics. We propose new hybrid metrics that provide high correlation with application performance, and may be useful for accurate performance prediction. For three different communication patterns and a production application, we demonstrate a very strong correlation between the proposed metrics and the execution time of these codes.
Keywords
decision trees; learning (artificial intelligence); multiprocessing systems; multiprocessor interconnection networks; performance evaluation; application performance prediction; communication data; communication features; communication graph; communication patterns; network congestion; network hardware counters; production application; randomized decision trees; supervised learning algorithms; task mapping; torus networks; Benchmark testing; Correlation; Decision trees; Hardware; Measurement; Predictive models; Radiation detectors; contention; modeling; prediction; supervised learning; task mapping; torus networks;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
Conference_Location
Denver, CO
Print_ISBN
978-1-4503-2378-9
Type
conf
DOI
10.1145/2503210.2503263
Filename
6877528
Link To Document