• DocumentCode
    2330913
  • Title

    Fault tolerant message routing on large parallel systems

  • Author

    Gordon, Jesse M. ; Stout, Quentin F.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • fYear
    1988
  • fDate
    10-12 Oct 1988
  • Firstpage
    155
  • Lastpage
    158
  • Abstract
    The problem of designing massively fault-tolerant message routing schemes for large parallel systems is considered. The notion of faults is extremely flexible and applies to all situations where a component is unavailable to participate in message communications. Attention is focused on the performance of schemes which use only local information to make local decisions. A framework for the analysis of fault-tolerant routing schemes is presented and used to analyze the efficacy of minimal path routing methods. Fault-tolerant routing schemes are derived by application of a technique called sidetracking. Viewed as making local decisions, a sidetracking scheme attempts to decrease the distance to the destination: if this is not possible, then the packet is routed randomly so as to increase the distance as little as possible. For single-message routing on a hypercube, it is shown that the performance of a sidetracking scheme is near optimal, successfully routing with high probability and low average excess delay. Applications of the sidetracking technique to single-message routing on a two-dimensional mesh and to multiple-message permutation routing on a hypercube are presented
  • Keywords
    fault tolerant computing; parallel processing; fault tolerant message routing; hypercube; large parallel systems; local decisions; local information; message communications; minimal path routing; multiple-message permutation routing; performance; sidetracking; single-message routing; two-dimensional mesh; Concurrent computing; Context; Delay; Distributed computing; Fault diagnosis; Fault tolerance; Fault tolerant systems; Hypercubes; Routing; Switched systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers of Massively Parallel Computation, 1988. Proceedings., 2nd Symposium on the Frontiers of
  • Conference_Location
    Fairfax, VA
  • Print_ISBN
    0-8186-5892-4
  • Type

    conf

  • DOI
    10.1109/FMPC.1988.47464
  • Filename
    47464