DocumentCode :
918037
Title :
Software-Based Failure Detection and Recovery in Programmable Network Interfaces
Author :
Zhou, Yizheng ; Lakamraju, Vijay ; Koren, Israel ; Krishna, C. Mani
Author_Institution :
Univ. of Massachusetts, Amherst
Volume :
18
Issue :
11
fYear :
2007
Firstpage :
1539
Lastpage :
1550
Abstract :
Emerging network technologies have complex network interfaces that have renewed concerns about network reliability. In this paper, we present an effective low-overhead fault tolerance technique to recover from network interface failures. Failure detection is based on a software watchdog timer that detects network processor hangs and a self-testing scheme that detects interface failures other than processor hangs. The proposed self-testing scheme achieves failure detection by periodically directing the control flow to go through only active software modules in order to detect errors that affect instructions in the local memory of the network interface. Our failure recovery is achieved by restoring the state of the network interface using a small backup copy containing just the right amount of information required for complete recovery. The paper shows how this technique can be made to minimize the performance impact to the host system and be completely transparent to the user.
Keywords :
built-in self test; fault diagnosis; fault tolerant computing; logic testing; network interfaces; built-in self-test; failure recovery; fault tolerance; network reliability; programmable network interface; software-based failure detection; Built-in self-test; Complex networks; Computer errors; Error correction; Fault tolerance; Hardware; Logic testing; Network interfaces; Single event upset; System testing; Programmable Network Interface Card (NIC); Single Event Upset (SEU); failure detection; radiation induced faults; self-testing;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2007.1093
Filename :
4339198
Link To Document :
بازگشت