Title :
A Codesigned Fault Tolerance System for Heterogeneous Many-Core Processors
Author :
Yim, Keun Soo ; Iyer, Ravishankar K.
Author_Institution :
Coordinated Sci. Lab., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
This paper presents an efficient fault tolerance system for heterogeneous many-core processors. The efficiencies and coverage of the presented fault tolerance are optimized by customizing the techniques for different types of components in the highest layers of system abstractions and codesigning the techniques in a way that separates algorithms and mechanisms.
Keywords :
fault tolerant computing; multiprocessing systems; codesigned fault tolerance system; heterogeneous many-core processors; system abstraction; Cloning; Detectors; Fault tolerance; Fault tolerant systems; Hardware; Program processors;
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.375