Title :
Memory controller for globally uncoordinated and locally coordinated checkpointing
Author :
Congal, Yoann ; Cartron, Mickael
Author_Institution :
Lab. de Fiabilisation des Syst. Embarques, CEA, Gif-sur-Yvette, France
Abstract :
In order to deal with soft errors that affect the logic of multiprocessors, a new architecture with backward error recovery capability is proposed. The proposed architecture uses the fault containment capability offered by memory protection units for reducing the overhead due to the checkpointing information management. A new hybrid checkpointing scheme is proposed that aims at performing coordinated checkpointing on logic data sets that are built on software ensembles that logically interact. A new memory controller able to manage the checkpointing information is proposed for this architecture. It has been designed for minimizing the performance overhead for error-free scenarios. The performance and memory overhead have been evaluated for different applications.
Keywords :
CMOS logic circuits; microprocessor chips; error-free; fault containment capability; global uncoordinated checkpointing; local coordinated checkpointing; logic data sets; memory controller; multiprocessors; software; Bit error rate; Checkpointing; Computer architecture; Context; Program processors; Robustness;
Conference_Titel :
Electronics, Circuits and Systems (ICECS), 2011 18th IEEE International Conference on
Conference_Location :
Beirut
Print_ISBN :
978-1-4577-1845-8
Electronic_ISBN :
978-1-4577-1844-1
DOI :
10.1109/ICECS.2011.6122327