Title :
The soft error problem: an architectural perspective
Author :
Mukherjee, Shubhendu S. ; Emer, Joel ; Reinhardt, Steven K.
Author_Institution :
FACT Group, Intel Corp., Hudson, MA, USA
Abstract :
Radiation-induced soft errors have emerged as a key challenge in computer system design. If the industry is to continue to provide customers with the level of reliability they expect, microprocessor architects must address this challenge directly. This effort has two parts. First, architects must understand the impact of soft errors on their designs. Second, they must select judiciously from among available techniques to reduce this impact in order to meet their reliability targets with minimum overhead. To provide a foundation for these efforts, this paper gives a broad overview of the soft error problem from an architectural perspective. We start with basic definitions, followed by a description of techniques to compute the soft error rate. Then, we summarize techniques used to reduce the soft error rate. This paper also describes problems with double-bit errors. Finally, this paper outlines future directions for architecture research in soft errors.
Keywords :
computer architecture; microprocessor chips; system recovery; computer architecture; computer system design; double-bit errors; radiation-induced soft errors; soft error problem; soft error rate; Additives; Alpha particles; Computer architecture; Computer errors; Error analysis; Error correction; Particle measurements; Pollution measurement; Semiconductor device measurement; Testing;
Conference_Titel :
High-Performance Computer Architecture, 2005. HPCA-11. 11th International Symposium on
Print_ISBN :
0-7695-2275-0
DOI :
10.1109/HPCA.2005.37