• DocumentCode
    5212
  • Title

    Low-overhead fault-tolerance technique for a dynamically reconfigurable softcore processor

  • Author

    Hung-Manh Pham ; Pillement, Sebastien ; Piestrak, Stanislaw J.

  • Author_Institution
    VNPT Technol. JSC, Hanoi, Vietnam
  • Volume
    62
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1179
  • Lastpage
    1192
  • Abstract
    In this paper, we propose a new approach to implement a reliable softcore processor on SRAM-based FPGAs, which can mitigate radiation-induced temporary faults (single-event upsets (SEUs)) at moderate cost. A new Enhanced Lockstep scheme built using a pair of MicroBlaze cores is proposed and implemented on Xilinx Virtex-5 FPGA. Unlike the basic lockstep scheme, ours allows to detect and eliminate its internal temporary configuration upsets without interrupting normal functioning. Faults are detected and eliminated using a Configuration Engine built on the basis of the PicoBlaze core which, to avoid a single point of failure, is implemented as fault-tolerant using triple modular redundancy (TMR). A softcore processor can recover from configuration upsets through partial reconfiguration combined with roll-forward recovery. SEUs affecting logic which are significantly less likely than those affecting configuration are handled by checkpointing and rollback. Finally, to handle permanent faults, the tiling technique is also proposed. The new Enhanced Lockstep scheme requires significantly shorter error recovery time compared to conventional lockstep scheme and uses significantly smaller number of slices compared to known TMR-based design (although at the cost of longer error recovery time). The efficiency of the proposed approach was validated through fault injection experiments.
  • Keywords
    SRAM chips; fault diagnosis; fault tolerant computing; field programmable gate arrays; integrated circuit reliability; logic design; microprocessor chips; radiation effects; reconfigurable architectures; redundancy; system recovery; MicroBlaze cores; PicoBlaze core; SEU; SRAM-based FPGA; TMR; Xilinx Virtex-5 FPGA; checkpointing; configuration engine; configuration upsets; dynamically reconfigurable softcore processor; enhanced lockstep scheme; error recovery time; fault detection; fault elimination; internal temporary configuration upset detection; internal temporary configuration upset elimination; low-overhead fault tolerance technique; partial reconfiguration; permanent fault handling; radiation-induced temporary fault mitigation; roll-forward recovery; rollback; single-event upset; softcore processor reliability; tiling technique; triple modular redundancy; Context; Fault tolerance; Fault tolerant systems; Field programmable gate arrays; Hardware; Random access memory; Tunneling magnetoresistance; Error recovery; FPGA; fault injection; fault-tolerance; lockstep; reconfigurable system; single-event upset (SEU); softcore processor;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2012.55
  • Filename
    6158641