• DocumentCode
    1267438
  • Title

    Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing

  • Author

    Dogan, Atakan ; Ozguner, Füsün

  • Author_Institution
    Dept. of Electr. Eng., Ohio State Univ., Columbus, OH, USA
  • Volume
    13
  • Issue
    3
  • fYear
    2002
  • fDate
    3/1/2002 12:00:00 AM
  • Firstpage
    308
  • Lastpage
    323
  • Abstract
    In a heterogeneous distributed computing system, machine and network failures are inevitable and can have an adverse effect on applications executing on the system. To reduce the effect of failures on an application executing on a failure-prone system, matching and scheduling algorithms which minimize not only the execution time but also the probability of failure of the application must be devised. However, because of the conflicting requirements, it is not possible to minimize both of the objectives at the same time. Thus, the goal of this paper is to develop matching and scheduling algorithms which account for both the execution time and the reliability of the application. This goal is achieved by modifying an existing matching and scheduling algorithm. The reliability of resources is taken into account using an incremental cost function proposed in this paper and the new algorithm is referred to as the reliable dynamic level scheduling algorithm. The incremental cost function can be defined based on one of the three cost functions developed here. These cost functions are unique in the sense that they are not restricted to tree-based networks and a specific matching and scheduling algorithm. The simulation results confirm that the proposed incremental cost function can be incorporated into matching and scheduling algorithms to produce schedules where the effect of failures of machines and network resources on the execution of the application is reduced and the execution time of the application is minimized as well
  • Keywords
    fault tolerant computing; probability; processor scheduling; workstation clusters; articulation points; cost functions; dynamic level scheduling algorithm; failure-prone system; heterogeneous distributed computing system; incremental cost function; machine failures; matching algorithms; network failures; scheduling algorithms; Scheduling algorithm;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.993209
  • Filename
    993209