DocumentCode :
1621260
Title :
Detailed design and evaluation of redundant multi-threading alternatives
Author :
Mukherjee, Shubhendu S. ; Kontz, Michael ; Reinhardt, Steven K.
Author_Institution :
Compaq Comput. Corp., Houston, TX, USA
fYear :
2002
fDate :
6/24/1905 12:00:00 AM
Firstpage :
99
Lastpage :
110
Abstract :
Exponential growth in the number of on-chip transistors, coupled. with reductions in. voltage levels, makes each generation of microprocessors increasingly vulnerable to transient faults. In a multi-threaded environment, we can detect these faults by running two copies of the same program as separate threads, feeding them identical inputs, and comparing their outputs, a technique we call redundant multi-threading (RMT). This paper studies RMT techniques in the context of both single- and dual-processor simultaneous multi-threaded (SMT) single-chip devices. Using a detailed, commercial-grade, SMT processor design we uncover subtle RMT implementation complexities, and find that RMT can be a more significant burden for single-processor devices than prior studies indicate. However, a novel application of RMT techniques in a dual-processor device, which we term chip-level redundant threading, shows higher performance than lock-stepping the two cores, especially on multi-threaded workloads
Keywords :
fault diagnosis; fault tolerant computing; multi-threading; parallel architectures; SRT processor; dual-processor device; fault-detection; microprocessors; redundant multithreading; simultaneous redundantly threaded processor; transient fault diagnosis; Degradation; Fault detection; Hardware; Microprocessors; Multithreading; Process design; Surface-mount technology; Transistors; Voltage; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 2002. Proceedings. 29th Annual International Symposium on
Conference_Location :
Anchorage, AK
ISSN :
1063-6897
Print_ISBN :
0-7695-1605-X
Type :
conf
DOI :
10.1109/ISCA.2002.1003566
Filename :
1003566
Link To Document :
بازگشت