DocumentCode
454307
Title
Software-based adaptive and concurrent self-testing in programmable network interfaces
Author
Zhou, Yizheng ; Lakamraju, Vijay ; Koren, Israel ; Krishna, C.M.
Author_Institution
Dept. of Electr. & Comput. Eng., Massachusetts Univ., Amherst, MA
Volume
1
fYear
0
fDate
0-0 0
Abstract
Emerging network technologies have complex network interfaces that have renewed concerns about network reliability. In this paper, we present an effective low-overhead failure detection technique, which is based on a software watchdog timer that detects network processor hangs and a self-testing scheme that detects interface failures other than processor hangs. The proposed adaptive and concurrent self-testing scheme achieves failure detection by periodically directing the control flow to go through only active software modules in order to detect errors that affect instructions in the local memory of the network interface. The paper shows how this technique can be made to minimize the performance impact on the host system and be completely transparent to the user
Keywords
computer network reliability; concurrency control; network interfaces; program testing; system recovery; concurrent self-testing; failure detection; programmable network interfaces; software watchdog timer; software-based adaptive self-testing; Built-in self-test; Computer network reliability; Error correction; Fault tolerance; Hardware; Intelligent networks; Logic testing; NASA; Network interfaces; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 2006. ICPADS 2006. 12th International Conference on
Conference_Location
Minneapolis, MN
ISSN
1521-9097
Print_ISBN
0-7695-2612-8
Type
conf
DOI
10.1109/ICPADS.2006.101
Filename
1655700
Link To Document