Title :
OE+IOE: A novel turn model based fault tolerant routing scheme for networks-on-chip
Author :
Pasricha, Sudeep ; Zou, Yong ; Connors, Dan ; Siegel, Howard Jay
Author_Institution :
Dept. of Electr. & Comput. Eng., Colorado State Univ., Fort Collins, CO, USA
Abstract :
Network-on-chip (NoC) communication architectures are increasingly being used today to interconnect cores on chip multiprocessors (CMPs). Permanent faults in NoCs due to fabrication challenges in ultra deep submicron (UDSM) technology nodes and due to wearout have led to an increased emphasis on fault tolerant design techniques. To ensure fault tolerant communication in NoCs, several fault tolerant routing algorithms have been proposed in recent years with the goal of routing flits around faults. A majority of these algorithms are based on the turn model approach due to its simplicity and inherent freedom from deadlock. However, existing turn model based fault tolerant routing algorithms are either too restrictive in the choice of paths that flits can traverse, or are tailored to work efficiently only on very specific fault distribution patterns. In this paper, we propose a novel low overhead fault tolerant routing scheme that combines the odd-even (OE) and inverted odd-even (IOE) turn models to achieve much better fault tolerance than traditional turn model based schemes. The proposed scheme uses replication opportunistically to optimize the balance between energy overhead and arrival rate. Our experimental results indicate that the proposed OE+IOE routing scheme provides better fault tolerance than existing turn model, N-random walk, and dual virtual channel based routing schemes that have been proposed in literature.
Keywords :
fault tolerance; microprocessor chips; multiprocessing systems; multiprocessor interconnection networks; network-on-chip; N-random walk; OE+IOE routing scheme; dual virtual channel based routing scheme; fault distribution pattern; fault tolerant routing scheme; interconnect core; inverted odd-even turn model; network-on-chip communication architecture; on chip multiprocessor; permanent fault; turn model approach; ultra deep submicron technology; Circuit faults; Fault tolerant systems; Redundancy; Routing; Runtime; System recovery; Fault-tolerant routing; Networks-on-chip;
Conference_Titel :
Hardware/Software Codesign and System Synthesis (CODES+ISSS), 2010 IEEE/ACM/IFIP International Conference on
Conference_Location :
Scottsdale, AZ
Print_ISBN :
978-1-6055-8905-3