• DocumentCode
    2483193
  • Title

    A multiprocessor scheduling algorithm for low overhead fault-tolerance

  • Author

    Hashimoto, Koji ; Tsuchiya, Tatsuhiro ; Kikuno, Tohru

  • Author_Institution
    Dept. of Inf. & Math. Sci., Osaka Univ., Japan
  • fYear
    1998
  • fDate
    20-23 Oct 1998
  • Firstpage
    186
  • Lastpage
    194
  • Abstract
    We propose a new scheduling algorithm for achieving fault tolerance in multiprocessor systems. The new algorithm partitions a parallel program into subsets of tasks based on some characteristics of a task graph. Then for each subset, the algorithm duplicates and schedules its tasks successively. Applying the proposed algorithm to three kinds of practical task graphs (Gaussian elimination, Laplace equation solver and LU decomposition), we conduct simulations. Experimental results show that fault tolerance can be achieved at the cost of a small degree of time redundancy, and that performance in the case of a processor failure is improved compared to a previous algorithm
  • Keywords
    multiprocessing systems; parallel programming; processor scheduling; software fault tolerance; Gaussian elimination; LU decomposition; Laplace equation solver; low overhead fault tolerance; multiprocessor scheduling algorithm; multiprocessor systems; parallel program partitioning; processor failure; task graph; task subsets; time redundancy; Fault tolerance; Laplace equations; Processor scheduling; Scheduling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 1998. Proceedings. Seventeenth IEEE Symposium on
  • Conference_Location
    West Lafayette, IN
  • ISSN
    1060-9857
  • Print_ISBN
    0-8186-9218-9
  • Type

    conf

  • DOI
    10.1109/RELDIS.1998.740493
  • Filename
    740493