DocumentCode :
2928722
Title :
Designing Reliable Architecture for Stateful Fault Tolerance
Author :
Saha, Indranil ; Mukhopadhyay, Debapriyay ; Banerjee, Satyajit
Author_Institution :
Honeywell Technol. Solutions Lab., Bangalore
fYear :
2006
fDate :
Dec. 2006
Firstpage :
545
Lastpage :
551
Abstract :
Performance and fault tolerance are two major issues that need to be addressed while designing highly available and reliable systems. The network topology or the notion of connectedness among the network nodes defines the system communication architecture and is an important design consideration for fault tolerant systems. A number of fault tolerant designs for specific multi-processor architecture exists in the literature, but none of them discriminates between stateless and stateful failover. In this paper, we propose a reliable network topology and a high availability framework which is tolerant up to a maximum of k node faults in a network and is designed specifically to meet the needs of stateful failover. Assuming the nodes in the network are capable of handling multiple processes, through our design we have been able to prove that in the event of k node failures the load can be uniformly distributed across the network - ensuring load balance. We also provide an useful characterization for the network, which under the proposed framework ensures one hop communication between the required nodes
Keywords :
computer network reliability; fault tolerant computing; multiprocessing systems; resource allocation; fault tolerant designs; fault tolerant systems; highly available systems; highly reliable systems; load balancing; multiprocessor architecture; network topology; one hop communication; reliable architecture design; stateful failover; stateful fault tolerance; system communication architecture; Application software; Availability; Costs; Fault tolerance; Fault tolerant systems; Hardware; Mission critical systems; Network topology; Redundancy; Telecommunication network reliability; Harary Graph; Load Balance.; Network Topology; Stateful Failover; k-Fault Tolerance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2006. PDCAT '06. Seventh International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7695-2736-1
Type :
conf
DOI :
10.1109/PDCAT.2006.55
Filename :
4032243
Link To Document :
بازگشت