DocumentCode :
1688665
Title :
Outlier detection in performance data of parallel applications
Author :
Benkert, Katharina ; Gabriel, Edgar ; Resch, Michael M.
Author_Institution :
Dept. of Comput. Sci., Univ. of Houston, Houston, TX
fYear :
2008
Firstpage :
1
Lastpage :
8
Abstract :
When an adaptive software component is employed to select the best-performing implementation for a communication operation at runtime, the correctness of the decision taken strongly depends on detecting and removing outliers in the data used for the comparison. This automatic decision is greatly complicated by the fact that the types and quantities of outliers depend on the network interconnect and the nodes assigned to the job by the batch scheduler. This paper evaluates four different statistical methods used for handling outliers, namely a standard interquartile range method, a heuristic derived from the trimmed mean value, cluster analysis and a method using robust statistics. Using performance data from the Abstract Data and Communication Library (ADCL) we evaluate the correctness of the decisions made with each statistical approach over three fundamentally different network interconnects, namely a highly reliable InfiniBand network, a gigabit Ethernet network having a larger variance in the performance, and a hierarchical gigabit Ethernet network.
Keywords :
multiprocessor interconnection networks; parallel processing; scheduling; statistical analysis; workstation clusters; InfiniBand network; adaptive software component; batch scheduler; cluster analysis; gigabit Ethernet network; network interconnect; outlier detection; parallel application; robust statistics; standard interquartile range; statistical method; Application software; Context; Ethernet networks; High performance computing; Runtime; Software libraries; Software performance; Statistical analysis; Switches; Telecommunication network reliability; adaptive communication libraries; outlier detection; performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
ISSN :
1530-2075
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2008.4536463
Filename :
4536463
Link To Document :
بازگشت