Title :
Fault-tolerant communication over Micronmesh NOC with Micron Message-Passing protocol
Author :
Kariniemi, Heikki ; Nurmi, Jari
Author_Institution :
Dept. of Comput. Syst., Tampere Univ. of Technol., Tampere, Finland
Abstract :
In the future multi-processor system-on-chip (MPSoC) platforms are becoming more vulnerable to transient and intermittent faults due to physical level problems of VLSI technologies. This sets new requirements to the fault-tolerance of the messaging layer software which applications use for communication, because the faults make the operation of the Network-on-Chip (NoC) hardware of the MPSoCs less reliable. This paper presents Micron Message-Passing (MMP) Protocol which is a light-weight protocol designed for improving the fault tolerance of the messaging layer of the MPSoCs where Micronmesh NoC is used. Its fault-tolerance is implemented by watchdog timers and cyclic redundancy checks (CRC) which are usable for detecting packet losses, communication deadlocks, and bit errors. These three functionalities are necessary, because without them the software executed on the MPSoCs is not able to detect the faults and recover from them. This paper presents also how the MMP Protocol can be used for implementing applications which are able to recover from communication faults.
Keywords :
VLSI; fault tolerance; message passing; multiprocessing systems; network-on-chip; protocols; VLSI technology; communication faults; fault-tolerant communication; micron message-passing protocol; micronmesh NOC; multiprocessor system-on-chip platform; network-on-chip; Application software; Cyclic redundancy check; Fault detection; Fault tolerance; Hardware; Network-on-a-chip; Protocols; System-on-a-chip; Telecommunication network reliability; Very large scale integration;
Conference_Titel :
System-on-Chip, 2009. SOC 2009. International Symposium on
Conference_Location :
Tampere
Print_ISBN :
978-1-4244-4465-6
Electronic_ISBN :
978-1-4244-4467-0
DOI :
10.1109/SOCC.2009.5335685