DocumentCode :
1819538
Title :
An Optimal Capacity Planning Algorithm for Provisioning Cluster-Based Failure-Resilient Composite Services
Author :
Zhang, Chun ; Chang, Rong N. ; Perng, Chang-Shing ; So, Edward ; Tang, Chungqiang ; Tao, Tao
Author_Institution :
T.J. Watson Res. Center, IBM, Yorktown Heights, NY, USA
fYear :
2009
fDate :
21-25 Sept. 2009
Firstpage :
112
Lastpage :
119
Abstract :
Resilience against unexpected server failures is a key desirable function of quality-assured service systems. A good capacity planning decision should cost-effectively allocate spare capacity for exploiting failure resilience mechanisms. In this paper, we propose an optimal capacity planning algorithm for server-cluster based service systems,particularly the ones that provision composite services via several servers. The algorithm takes into account two commonly used failure resilience mechanisms: intra-cluster load-controlling and inter-cluster failover. The goal is to minimize the resource cost while assuring service levels on the end-to-end throughput and response time of provisioned composite services under normal conditions and server failure conditions. We illustrate that the stated goal can be formalized as a capacity planning optimization problem and can be solved mathematically via convex analysis and linear optimization techniques. We also quantitatively demonstrate that the proposed algorithm can find the min-cost capacity planning solution that assures the end-to-end performance of managed composite services for both the non-failure case and the common server failure cases in a three-tier web-based service system with multiple server clusters. To the best of our knowledge, this paper presents the first research effort in optimizing the cost of supporting failure resilience for quality-assured composite services.
Keywords :
convex programming; fault tolerant computing; linear programming; system recovery; capacity planning optimization; convex analysis; failure resilience mechanism; failure-resilient composite service; inter-cluster failover; intra-cluster load-controlling; linear optimization; min-cost capacity planning; optimal capacity planning decision; quality-assured composite service; quality-assured service; resource cost; server-cluster based service; three-tier Web-based service; unexpected server failure; Capacity planning; Clustering algorithms; Cost function; Delay; Design optimization; File servers; Large-scale systems; Resilience; Service oriented architecture; Throughput; Capacity Planning; Failure Resilient Services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services Computing, 2009. SCC '09. IEEE International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4244-5183-8
Electronic_ISBN :
978-0-7695-3811-2
Type :
conf
DOI :
10.1109/SCC.2009.81
Filename :
5284009
Link To Document :
بازگشت