DocumentCode
1828326
Title
Towards the Robustness of Dynamic Loop Scheduling on Large-Scale Heterogeneous Distributed Systems
Author
Banicescu, Ioana ; Ciorba, Florina M. ; Cariño, Ricolindo L.
Author_Institution
Dept. of Comput. Sci. & Eng., Mississippi State Univ., Starkville, MS, USA
fYear
2009
fDate
June 30 2009-July 4 2009
Firstpage
129
Lastpage
132
Abstract
Dynamic loop scheduling (DLS) algorithms provide application-level load balancing of loop iterates, with the goal of maximizing application performance on the underlying system. These methods use run-time information regarding the performance of the application´s execution (for which irregularities change over time). Many DLS methods are based on probabilistic analyses, and therefore account for unpredictable variations of application and system related parameters. Scheduling scientific and engineering applications in large-scale distributed systems (possibly shared with other users) makes the problem of DLS even more challenging. Moreover, the chances of failure, such as processor or link failure, are high in such large-scale systems. In this paper, we employ the hierarchical approach for three DLS methods, and propose metrics for quantifying their robustness with respect to variations of two parameters (load and processor failures), for scheduling irregular applications in large-scale heterogeneous distributed systems.
Keywords
distributed processing; probability; program control structures; resource allocation; scheduling; application-level load balancing; dynamic loop scheduling; engineering application scheduling; hierarchical approach; large-scale heterogeneous distributed systems; loop iterates; probabilistic analyses; run-time information; scientific application scheduling; Computational modeling; Computer science; Distributed computing; Dynamic scheduling; Large-scale systems; Load management; Processor scheduling; Robustness; Runtime; Vehicle dynamics; dynamic load balancing; irregular tasks; processor failures; robustness metrics;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing, 2009. ISPDC '09. Eighth International Symposium on
Conference_Location
Lisbon
Print_ISBN
978-0-7695-3680-4
Type
conf
DOI
10.1109/ISPDC.2009.39
Filename
5284360
Link To Document