DocumentCode :
745226
Title :
Performance management for cluster-based web services
Author :
Pacifici, Giovanni ; Spreitzer, Mike ; Tantawi, Asser N. ; Youssef, Alaa
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
23
Issue :
12
fYear :
2005
Firstpage :
2333
Lastpage :
2343
Abstract :
We present an architecture and prototype implementation of a performance management system for cluster-based web services. The system supports multiple classes of web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to differentiated service. In this paper, we will use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based web services. We report experimental results that show the dynamic behavior of the system.
Keywords :
Internet; client-server systems; computer network management; network servers; quality of service; queueing theory; resource allocation; scheduling; telecommunication traffic; workstation clusters; QoS; SOAP; client-server code; cluster-based Web service; continuous monitoring; feedback control loop; load balancing; network interface; network traffic; performance management system; quality-of-service; queuing; request message; scheduling; server overload protection; server resource allocation; service differentiation; utility function; Delay; Load management; Measurement; Network interfaces; Network servers; Prototypes; Resource management; Service oriented architecture; Traffic control; Web services; Clustered computing; Web services; performance management; quality-of-service (QoS); resource allocation; service differentiation; utility functions;
fLanguage :
English
Journal_Title :
Selected Areas in Communications, IEEE Journal on
Publisher :
ieee
ISSN :
0733-8716
Type :
jour
DOI :
10.1109/JSAC.2005.857208
Filename :
1546102
Link To Document :
بازگشت