DocumentCode
238566
Title
Robustness Prediction and Evaluation of Divisible Load Scheduling on Computing Systems with Unpredictable Variations
Author
Balasubramaniam, M. ; Banicescu, Ioana ; Ciorba, Florina M.
Author_Institution
Dept. of Comput. Sci. & Eng., Mississippi State Univ., Starkville, MS, USA
fYear
2014
fDate
24-27 June 2014
Firstpage
170
Lastpage
177
Abstract
This work addresses the problem of predicting and evaluating the robustness of divisible load scheduling of data parallel workloads (also called arbitrarily divisible workloads) onto high performance parallel and distributed computing systems with unpredictable variations. Divisible load scheduling is based on the divisible load theory (DLT) which offers a linear, deterministic, and tractable model for scheduling arbitrarily divisible workloads. High performance parallel and distributed computing systems operate in an environment characterized by unpredictable variations (or perturbations) such as system load or unexpected resource failures. In this work, we analytically evaluate and empirically determine the robustness of divisible load scheduling algorithms (called DLT algorithms) with respect to variations in processor availability via realistic simulation. The realism arises from modeling the characteristics of two applications from the NAS parallel benchmark suite, as well as from modeling the target system as a 3D torus topology, one of the most widely used interconnection networks. Extending prior related work, we conduct an analytical evaluation as well as a simulation-based study of the robustness of divisible load scheduling for scheduling the two NAS parallel benchmarks, namely, embarrassingly parallel (EP) which is computationally intensive and integer sort (IS) which is communication intensive. The simulation results indicate that the robustness observed via simulation of scheduling the EP benchmark is always within the analytically predicted range, and that the robustness observed via simulation of scheduling the IS benchmark is within the analytically predicted range in the best case, and within 6.56% on average in the worst case.
Keywords
parallel processing; scheduling; 3D torus topology; DLT; DLT algorithm; NAS parallel benchmark suite; data parallel workload; distributed computing systems; divisible load scheduling algorithm; divisible load theory; embarrassingly parallel benchmark; high performance parallel computing systems; integer sort benchmark; robustness evaluation; robustness prediction; Benchmark testing; Computational modeling; Load modeling; Processor scheduling; Robustness; Runtime; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing (ISPDC), 2014 IEEE 13th International Symposium on
Conference_Location
Marseilles
Print_ISBN
978-1-4799-5918-1
Type
conf
DOI
10.1109/ISPDC.2014.26
Filename
6900216
Link To Document