Title :
UnSync: A Soft Error Resilient Redundant Multicore Architecture
Author :
Jeyapaul, Reiley ; Hong, Fei ; Rhisheekesan, Abhishek ; Shrivastava, Aviral ; Lee, Kyoungwoo
Author_Institution :
Compiler Microarchiteture Lab., Arizona State Univ., Tempe, AZ, USA
Abstract :
Reducing device dimensions, increasing transistor densities, and smaller timing windows, expose the vulnerability of processors to soft errors induced by charge carrying particles. Since these factors are only consequences of the inevitable advancement in processor technology, the industry has been forced to improve reliability on general purpose Chip Multiprocessors (CMPs). With the availability of increased hardware resources, redundancy based techniques are the most promising methods to eradicate soft error failures in CMP systems. In this work, we propose a novel redundant CMP architecture (UnSync) that utilizes hardware based detection mechanisms (most of which are readily available in the processor), to reduce overheads during error free executions. In the presence of errors (which are infrequent), the "always forward execution" enabled recovery mechanism provides for resilience in the system. We design a detailed RTL model of our UnSync architecture and perform hardware synthesis to compare the hardware (power/area) overheads incurred. We compare the same with those of the Reunion technique, a state-of-the-art redundant multi-core architecture. We also perform cycle-accurate simulations over a wide range of SPEC2000, and MiBench benchmarks to evaluate the performance efficiency achieved over that of the Reunion architecture. Experimental results show that, our UnSync architecture reduces power consumption by 34.5% and improves performance by up to 20% with 13.3% less area overhead, when compared to Reunion architecture for the same level of reliability achieved.
Keywords :
computer architecture; failure analysis; fault tolerant computing; microprocessor chips; power aware computing; system recovery; CMP systems; MiBench benchmark; RTL model; Reunion technique; SPEC2000; UnSync architecture; always forward execution enabled recovery mechanism; area overhead; charge carrying particles; cycle-accurate simulation; device dimension reduction; error free execution; general purpose chip multiprocessors; hardware based detection mechanism; hardware resources; hardware synthesis; overhead reduction; performance efficiency; power consumption reduction; power overhead; processor technology; processor vulnerability; redundancy based technique; redundant CMP architecture; soft error failure; soft error resilient redundant multicore architecture; system reliability; system resilience; timing window; transistor density; Computer architecture; Hardware; Instruction sets; Pipelines; Redundancy; core-level redundancy; error resilient; hardware detection; low power; multi-core architecture; redundant architecture; soft error;
Conference_Titel :
Parallel Processing (ICPP), 2011 International Conference on
Conference_Location :
Taipei City
Print_ISBN :
978-1-4577-1336-1
Electronic_ISBN :
0190-3918
DOI :
10.1109/ICPP.2011.76