DocumentCode :
3653741
Title :
A multi-layer software-based fault-tolerance approach for heterogenous multi-core systems
Author :
S. M?ller;T. Koal;S. Scharoba;H.T. Vierhaus;M. Sch?lzel
Author_Institution :
Brandenburg University of Technology, Cottbus, Germany
fYear :
2015
fDate :
3/1/2015 12:00:00 AM
Firstpage :
1
Lastpage :
6
Abstract :
This paper describes a software-based technique for building heterogeneous fault tolerant multi-core systems, which are able to handle temporary and permanent hardware faults autonomously in two system layers. The fault tolerance technique relies on a single concept for adapting the binary code of the user application to the current fault state of a single core. Thereby this scheme is used either for a local repair of each core or for a global repair. By the global repair, the task assigned to a faulty core may be rescheduled to another core that provides enough resources for the execution of the task. Thereby the local repair scheme is reused for the adaptation of the rescheduled task. It is shown that the reliability of a multi-core system can be improved significantly, when using the global repair together with the local repair instead of using the local repair only.
Keywords :
"Maintenance engineering","Registers","Multicore processing","Redundancy","Hardware","Binary codes","Multiprocessor interconnection"
Publisher :
ieee
Conference_Titel :
Test Symposium (LATS), 2015 16th Latin-American
ISSN :
2373-0862
Type :
conf
DOI :
10.1109/LATW.2015.7102508
Filename :
7102508
Link To Document :
بازگشت