• DocumentCode
    1785584
  • Title

    Detection of Silent Data Corruption in fault-tolerant distributed systems on board spacecraft

  • Author

    Fayyaz, Muhammad ; Vladimirova, Tanya

  • Author_Institution
    Dept. of Eng., Univ. of Leicester, Leicester, UK
  • fYear
    2014
  • fDate
    14-17 July 2014
  • Firstpage
    202
  • Lastpage
    209
  • Abstract
    In this paper a novel distributed architecture for system level Fault Detection, Isolation and Recovery (FDIR) aimed at spacecraft applications is presented. The architecture reconfigures itself in the case of a failure for seamless adaptability and operation. Two new algorithms for detection of Silent Data Corruption (SDC) errors are proposed. A selective redundancy method is employed for transient SDC errors, while a distributed mechanism based upon a data signature value is employed for permanent SDC errors. Experimental results based on prototyping with Xilinx Zynq FPGAs are reported, which show that the proposed method is capable of detecting SDC faults in distributed nodes and tolerates node failures by migrating tasks to healthy nodes. Evaluation results show that the proposed SDC detection algorithms achieve very good fault coverage, while using much lower additional resources compared with physical redundancy.
  • Keywords
    data handling; distributed processing; fault diagnosis; fault tolerance; field programmable gate arrays; on-board communications; space vehicles; FDIR; Xilinx Zynq FPGA; distributed architecture; fault detection isolation and recovery; fault-tolerant distributed systems; on-board computing systems; silent data corruption; spacecraft; Fault detection; Fault tolerance; Fault tolerant systems; Hardware; Program processors; Transient analysis; computing; distributed; fault detection; isolation and reconfiguration; onboard; silent data corruption; spacecraft; symptom;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Hardware and Systems (AHS), 2014 NASA/ESA Conference on
  • Conference_Location
    Leicester
  • Type

    conf

  • DOI
    10.1109/AHS.2014.6880178
  • Filename
    6880178