Title :
SoftBeam: Precise tracking of transient faults and vulnerability analysis at processor design time
Author :
Gschwind, Michael ; Salapura, Valentina ; Trammell, Catherine ; McKee, Sally A.
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
To study system reliability of a next-generation system, we undertake a soft error vulnerability study for a next-generation microprocessor design. Starting from design data for the entire processor, we extend the microprocessor verification methodology to study soft error propagation through microprocessor logic into the architected processor state. We use soft error injection into randomly selected latch bits to (1) identify areas for improvement, (2) derate technology susceptibility by architectural, microarchitectural, and logic masking resulting in increased soft error resilience; and (3) identify areas where microarchitectural data corruption can be tolerated as performance degradation without impact on correctness, yielding even greater soft error resilience. Based on these results, we reduce design vulnerability to soft errors by factors ranging from 2 for an execution unit to more than 32 for a memory management unit.
Keywords :
microprocessor chips; reliability; SoftBeam; architected processor state; logic masking; memory management unit; microarchitectural data corruption; microprocessor logic; microprocessor verification methodology; next-generation microprocessor design; next-generation system; performance degradation; processor design time; soft error injection; soft error propagation; soft error resilience; soft error vulnerability; system reliability; technology susceptibility; transient faults; vulnerability analysis; Circuit faults; Clocks; Kernel; Latches; Microprocessors; Resilience; Transient analysis;
Conference_Titel :
Computer Design (ICCD), 2011 IEEE 29th International Conference on
Conference_Location :
Amherst, MA
Print_ISBN :
978-1-4577-1953-0
DOI :
10.1109/ICCD.2011.6081430