DocumentCode
745226
Title
Performance management for cluster-based web services
Author
Pacifici, Giovanni ; Spreitzer, Mike ; Tantawi, Asser N. ; Youssef, Alaa
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
23
Issue
12
fYear
2005
Firstpage
2333
Lastpage
2343
Abstract
We present an architecture and prototype implementation of a performance management system for cluster-based web services. The system supports multiple classes of web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to differentiated service. In this paper, we will use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based web services. We report experimental results that show the dynamic behavior of the system.
Keywords
Internet; client-server systems; computer network management; network servers; quality of service; queueing theory; resource allocation; scheduling; telecommunication traffic; workstation clusters; QoS; SOAP; client-server code; cluster-based Web service; continuous monitoring; feedback control loop; load balancing; network interface; network traffic; performance management system; quality-of-service; queuing; request message; scheduling; server overload protection; server resource allocation; service differentiation; utility function; Delay; Load management; Measurement; Network interfaces; Network servers; Prototypes; Resource management; Service oriented architecture; Traffic control; Web services; Clustered computing; Web services; performance management; quality-of-service (QoS); resource allocation; service differentiation; utility functions;
fLanguage
English
Journal_Title
Selected Areas in Communications, IEEE Journal on
Publisher
ieee
ISSN
0733-8716
Type
jour
DOI
10.1109/JSAC.2005.857208
Filename
1546102
Link To Document