DocumentCode
1181609
Title
An on-line, business-oriented optimization of performance and availability for utility computing
Author
Hellerstein, Joseph L. ; Katircioglu, Kaan ; Surendra, Maheswaran
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
23
Issue
10
fYear
2005
Firstpage
2013
Lastpage
2021
Abstract
Utility computing provides a pay-as-you-go approach to information systems in which application providers (e.g., web sites) can better manage their costs by adding capacity in response to increased demands and shedding capacity when it is no longer needed. This paper addresses application providers who use clusters of servers. Our work develops a framework to determine the number of servers that minimizes the sum of quality-of-service (QoS) costs resulting from service level penalties and server holding costs for the server cluster. The server characteristics considered are service rate, failure rates, repair rates, and costs. The contributions of this paper are: 1) a model for the performance and availability of an e-Commerce system that is consistent with data from a multisystem testbed with an e-Commerce workload; 2) a business-oriented cost model for resource allocation for application providers; 3) a closed form approximation for the optimal allocation of servers for an application provider based on the performance model in 1) and the cost model in 2); and 4) a simple criteria for utility owners and server manufacturers to make tradeoffs between server characteristics.
Keywords
business communication; costing; electronic commerce; optimisation; quality of service; resource allocation; utility programs; QoS; SLA; application provider; business-oriented cost model; closed form approximation; e-commerce system; information system; multisystem testbed; on-line business-oriented optimization; optimal server allocation; quality-of-service; resource allocation; server cluster; server holding cost; service level agreement; utility computing; Application software; Availability; Cost function; Delay; Hardware; Licenses; Management information systems; Quality of service; Resource management; System testing; Optimal server allocation; performability; service level agreement (SLA);
fLanguage
English
Journal_Title
Selected Areas in Communications, IEEE Journal on
Publisher
ieee
ISSN
0733-8716
Type
jour
DOI
10.1109/JSAC.2005.854125
Filename
1514530
Link To Document