• DocumentCode
    1828326
  • Title

    Towards the Robustness of Dynamic Loop Scheduling on Large-Scale Heterogeneous Distributed Systems

  • Author

    Banicescu, Ioana ; Ciorba, Florina M. ; Cariño, Ricolindo L.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Mississippi State Univ., Starkville, MS, USA
  • fYear
    2009
  • fDate
    June 30 2009-July 4 2009
  • Firstpage
    129
  • Lastpage
    132
  • Abstract
    Dynamic loop scheduling (DLS) algorithms provide application-level load balancing of loop iterates, with the goal of maximizing application performance on the underlying system. These methods use run-time information regarding the performance of the application´s execution (for which irregularities change over time). Many DLS methods are based on probabilistic analyses, and therefore account for unpredictable variations of application and system related parameters. Scheduling scientific and engineering applications in large-scale distributed systems (possibly shared with other users) makes the problem of DLS even more challenging. Moreover, the chances of failure, such as processor or link failure, are high in such large-scale systems. In this paper, we employ the hierarchical approach for three DLS methods, and propose metrics for quantifying their robustness with respect to variations of two parameters (load and processor failures), for scheduling irregular applications in large-scale heterogeneous distributed systems.
  • Keywords
    distributed processing; probability; program control structures; resource allocation; scheduling; application-level load balancing; dynamic loop scheduling; engineering application scheduling; hierarchical approach; large-scale heterogeneous distributed systems; loop iterates; probabilistic analyses; run-time information; scientific application scheduling; Computational modeling; Computer science; Distributed computing; Dynamic scheduling; Large-scale systems; Load management; Processor scheduling; Robustness; Runtime; Vehicle dynamics; dynamic load balancing; irregular tasks; processor failures; robustness metrics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing, 2009. ISPDC '09. Eighth International Symposium on
  • Conference_Location
    Lisbon
  • Print_ISBN
    978-0-7695-3680-4
  • Type

    conf

  • DOI
    10.1109/ISPDC.2009.39
  • Filename
    5284360