Title :
Application fault tolerance with Armor middleware
Author :
Kalbarczyk, Zbigniew ; Iyer, Ravishankar K. ; Wang, Long
Author_Institution :
Illinois Univ., Urbana, IL, USA
Abstract :
Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, which is often prohibitively expensive for practical use due to its high performance overhead and cost. The adaptive reconfigurable mobile objects of reliability (Armor) middleware architecture offers a scalable low-overhead way to provide high-dependability services to applications. It uses coordinated multithreaded processes to manage redundant resources across interconnected nodes, detect errors in user applications and infrastructural components, and provide failure recovery. The authors describe the experiences and lessons learned in deploying Armor in several diverse fields.
Keywords :
computer network reliability; middleware; multi-threading; reconfigurable architectures; redundancy; software fault tolerance; software performance evaluation; system recovery; Armor middleware; adaptive reconfigurable mobile objects of reliability; coordinated multithreaded process; failure recovery; process replication; software-implemented fault tolerance rely; Application software; Computer architecture; Costs; Fault tolerance; Fault tolerant systems; Hardware; Middleware; Mobile computing; Resource management; Yarn; failure recovery; high-dependability services; middleware; software-implemented fault tolerance;
Journal_Title :
Internet Computing, IEEE
DOI :
10.1109/MIC.2005.31