Title :
Automatic Instruction-Level Software-Only Recovery
Author :
Chang, Jonathan ; Reis, George A. ; August, David I.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Princeton Univ., NJ
Abstract :
As chip densities and clock rates increase, processors are becoming more susceptible to transient faults that can affect program correctness. Computer architects have typically addressed reliability issues by adding redundant hardware, but these techniques are often too expensive to be used widely. Software-only reliability techniques have shown promise in their ability to protect against soft-errors without any hardware overhead. However, existing low-level software-only fault tolerance techniques have only addressed the problem of detecting faults, leaving recovery largely unaddressed. In this paper, we present the concept, implementation, and evaluation of automatic, instruction-level, software-only recovery techniques, as well as various specific techniques representing different trade-offs between reliability and performance. Our evaluation shows that these techniques fulfill the promises of instruction-level, software-only fault tolerance by offering a wide range of flexible recovery options
Keywords :
instruction sets; program compilers; program verification; software fault tolerance; system recovery; automatic instruction-level software-only recovery; chip density; clock rates; low-level software-only fault tolerance techniques; program compilers; software-only reliability techniques; Application software; Clocks; Costs; Fault detection; Fault tolerance; Hardware; Microprocessors; Power system reliability; Protection; Redundancy;
Conference_Titel :
Dependable Systems and Networks, 2006. DSN 2006. International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7695-2607-1
DOI :
10.1109/DSN.2006.15