• DocumentCode
    187048
  • Title

    HardPaxos: Replication Hardened against Hardware Errors

  • Author

    Behrens, Diogo ; Kuvaiskii, Dmitrii ; Fetzer, Christof

  • Author_Institution
    Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2014
  • fDate
    6-9 Oct. 2014
  • Firstpage
    232
  • Lastpage
    241
  • Abstract
    State Machine Replication (SMR) is a common technique to make services fault-tolerant. Practical SMR systems tolerate process crashes, but no hardware errors such as bit flips. Still, hardware errors can cause major service outages, and their rate is expected to increase in the future. Current approaches either incur a high overhead by hardening large parts of the system in software, or increase the cost of ownership by introducing additional hardware components. This work presents HardPaxos, an atomic broadcast algorithm for SMR that enables services to tolerate hardware errors, while incurring little performance and state overhead. HardPaxos requires no additional hardware and has only a small part of its functionality hardened using a combination of AN-encoding and duplicated execution. Our evaluation shows a throughput overhead of at most 5% for typical payload sizes. Moreover, fault injection experiments show that our hardening decreases the number of undetected errors from 15% to 0.02%.
  • Keywords
    fault tolerant computing; AN-encoding; HardPaxos; SMR systems; atomic broadcast algorithm; duplicated execution; fault injection; replication hardened against hardware errors; state machine replication; state overhead; Computer crashes; Hardware; Law; Libraries; Proposals; Radiation detectors; Byzantine faults; Paxos; hardware errors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems (SRDS), 2014 IEEE 33rd International Symposium on
  • Conference_Location
    Nara
  • Type

    conf

  • DOI
    10.1109/SRDS.2014.13
  • Filename
    6983398