• DocumentCode
    2364261
  • Title

    Fault-tolerant routing in 2D tori or meshes using limited-global-safety information

  • Author

    Xiang, Dong ; Chen, Ai

  • Author_Institution
    Inst. of Microelectron., Tsinghua Univ., Beijing, China
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    231
  • Lastpage
    238
  • Abstract
    A limited-global-safety-information-based metric called local safety is proposed to handle fault-tolerant routing in 2D tori (or meshes). Sufficient conditions for existence of a minimum feasible path between the source and destination is presented based on local safety information in a 2D torus network. An efficient heuristic function is defined to guide fault-tolerant routing inside a 2D torus network. Unlike the conventional methods based on the block fault model, our method does not disable any fault-free nodes and fault-free nodes inside a fault block can still be a source or a destination, which can greatly increase throughput and computational power of the system. Techniques for avoidance of deadlocks are introduced. Extensive simulation results are presented.
  • Keywords
    concurrency control; fault tolerant computing; heuristic programming; multiprocessing systems; 2D torus network; computational power; deadlock avoidance; efficient heuristic function; fault-tolerant routing; limited-global-safety information; meshes; Circuit faults; Fault tolerance; Hypercubes; Microelectronics; Power system modeling; Routing; Safety; Sufficient conditions; System recovery; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 2002. Proceedings. International Conference on
  • ISSN
    0190-3918
  • Print_ISBN
    0-7695-1677-7
  • Type

    conf

  • DOI
    10.1109/ICPP.2002.1040878
  • Filename
    1040878