DocumentCode :
599925
Title :
An Efficient Fault-Tolerant Algorithm for Distributed Cloud Services
Author :
Al-Jaroodi, Jameela ; Mohamed, N. ; Nuaimi, K.A.
Author_Institution :
Middleware Technol. Lab., UAEU, Al Ain, United Arab Emirates
fYear :
2012
fDate :
3-4 Dec. 2012
Firstpage :
1
Lastpage :
8
Abstract :
Several approaches for fault-tolerance in distributed systems were introduced; however, they require prior knowledge of the environment´s operating conditions and/or constant monitoring of these conditions at run time. That allows the applications to adjust the load and redistribute the tasks when failures occur. These techniques work well when there is no high communication delay. Yet, this is not true in the Cloud, where data and computation servers are connected over the Internet and distributed across large geographic areas. Thus they usually exhibit high and dynamic communication delays that make discovering and recovering from failures take a long time. This paper proposes a delay-tolerant fault-tolerance algorithm that effectively reduces execution time and adapts for failures while minimizing the fault discovery and recovery overhead in the Cloud. Distributed tasks that can use this algorithm include downloading data from replicated servers and executing parallel applications on multiple independent distributed servers in the Cloud. The experimental results show the efficiency of the algorithm and its fault tolerance feature.
Keywords :
client-server systems; cloud computing; software fault tolerance; Internet; computation servers; condition monitoring; data downloading; data servers; delay-tolerant fault tolerance algorithm; distributed cloud services; distributed servers; dynamic communication delays; execution time; fault discovery; fault recovery; replicated servers; Delay; Fault tolerance; Fault tolerant systems; Heuristic algorithms; Load management; Monitoring; Servers; Cloud computing; fault-tolerance; heterogeneous distributed systems; load balancing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Cloud Computing and Applications (NCCA), 2012 Second Symposium on
Conference_Location :
London
Print_ISBN :
978-1-4673-5581-0
Type :
conf
DOI :
10.1109/NCCA.2012.21
Filename :
6472452
Link To Document :
بازگشت