• DocumentCode
    1884821
  • Title

    A fault-tolerant programmable voter for software-based N-modular redundancy

  • Author

    Yim, Keun Soo ; Sidea, Valentin ; Kalbarczyk, Zbigniew ; Chen, Deming ; Iyer, Ravishankar K.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2012
  • fDate
    3-10 March 2012
  • Firstpage
    1
  • Lastpage
    20
  • Abstract
    This paper presents a fault-tolerant, programmable voter architecture for software-implemented N-tuple modular redundant (NMR) computer systems. Software NMR is a cost-efficient solution for high-performance, mission-critical computer systems because this can be built on top of commercial off-the-shelf (COTS) devices. Due to the large volume and randomness of voting data, software NMR system requires a programmable voter. Our experiment shows that voting software that executes on a processor has the time-of-check-to-time-of-use (TOCTTOU) vulnerabilities and is unable to tolerate long duration faults. In order to address these two problems, we present a special-purpose voter processor and its embedded software architecture. The processor has a set of new instructions and hardware modules that are used by the software in order to accelerate the voting software execution and address the identified two reliability problems. We have implemented the presented system on an FPGA platform. Our evaluation result shows that using the presented system reduces the execution time of error detection codes (commonly used in voting software) by 14% and their code size by 56%. Our fault injection experiments validate that the presented system removes the TOCTTOU vulnerabilities and recovers under both transient and long duration faults. This is achieved by using 0.7% extra hardware in a baseline processor.
  • Keywords
    embedded systems; fault tolerant computing; field programmable gate arrays; redundancy; software architecture; software reliability; FPGA platform; commercial off-the-shelf devices; embedded software architecture; error detection codes; fault injection experiments; fault-tolerant programmable voter architecture; high-performance mission-critical computer systems; reliability problems; software-based N-modular redundancy; software-implemented N-tuple modular redundant computer systems; special-purpose voter processor; time-of-check-to-time-of-use vulnerabilities; voting software execution; Computer architecture; Hardware; Nuclear magnetic resonance; Redundancy; Software; Software reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Aerospace Conference, 2012 IEEE
  • Conference_Location
    Big Sky, MT
  • ISSN
    1095-323X
  • Print_ISBN
    978-1-4577-0556-4
  • Type

    conf

  • DOI
    10.1109/AERO.2012.6187253
  • Filename
    6187253