• DocumentCode
    454307
  • Title

    Software-based adaptive and concurrent self-testing in programmable network interfaces

  • Author

    Zhou, Yizheng ; Lakamraju, Vijay ; Koren, Israel ; Krishna, C.M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Massachusetts Univ., Amherst, MA
  • Volume
    1
  • fYear
    0
  • fDate
    0-0 0
  • Abstract
    Emerging network technologies have complex network interfaces that have renewed concerns about network reliability. In this paper, we present an effective low-overhead failure detection technique, which is based on a software watchdog timer that detects network processor hangs and a self-testing scheme that detects interface failures other than processor hangs. The proposed adaptive and concurrent self-testing scheme achieves failure detection by periodically directing the control flow to go through only active software modules in order to detect errors that affect instructions in the local memory of the network interface. The paper shows how this technique can be made to minimize the performance impact on the host system and be completely transparent to the user
  • Keywords
    computer network reliability; concurrency control; network interfaces; program testing; system recovery; concurrent self-testing; failure detection; programmable network interfaces; software watchdog timer; software-based adaptive self-testing; Built-in self-test; Computer network reliability; Error correction; Fault tolerance; Hardware; Intelligent networks; Logic testing; NASA; Network interfaces; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 2006. ICPADS 2006. 12th International Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1521-9097
  • Print_ISBN
    0-7695-2612-8
  • Type

    conf

  • DOI
    10.1109/ICPADS.2006.101
  • Filename
    1655700