Title :
Towards Byzantine Fault Tolerance in Many-Core Computing Platforms
Author :
Jeffery, Casey M. ; Figueiredo, Renato J O
Author_Institution :
Univ. of Florida, Gainesville
Abstract :
This paper presents a flexible technique that can be applied to many-core architectures to exploit idle resources and ensure reliable system operation. A dynamic fault tolerance layer is interposed between the hardware and OS through the use of a hypervisor. The introduction of a single point of failure is avoided by incorporating the hypervisor into the sphere of replication. This approach simplifies implementation over specialized hardware- or OS-based techniques while offering flexibility in the level of protection provided ranging from duplex to Byzantine protection. The feasibility of the approach is considered for both near- and long-term computing platforms.
Keywords :
computer architecture; fault tolerance; microprocessor chips; operating systems (computers); Byzantine fault tolerance; OS-based techniques; hypervisor; many-core architectures; many-core computing platforms; Computer architecture; Fault tolerance; Fault tolerant systems; Frequency; Hardware; Information systems; Laboratories; Network-on-a-chip; Protection; Virtual machine monitors;
Conference_Titel :
Dependable Computing, 2007. PRDC 2007. 13th Pacific Rim International Symposium on
Conference_Location :
Melbourne, Qld.
Print_ISBN :
0-7695-3054-0
DOI :
10.1109/PRDC.2007.40