• DocumentCode
    745226
  • Title

    Performance management for cluster-based web services

  • Author

    Pacifici, Giovanni ; Spreitzer, Mike ; Tantawi, Asser N. ; Youssef, Alaa

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    23
  • Issue
    12
  • fYear
    2005
  • Firstpage
    2333
  • Lastpage
    2343
  • Abstract
    We present an architecture and prototype implementation of a performance management system for cluster-based web services. The system supports multiple classes of web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to differentiated service. In this paper, we will use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based web services. We report experimental results that show the dynamic behavior of the system.
  • Keywords
    Internet; client-server systems; computer network management; network servers; quality of service; queueing theory; resource allocation; scheduling; telecommunication traffic; workstation clusters; QoS; SOAP; client-server code; cluster-based Web service; continuous monitoring; feedback control loop; load balancing; network interface; network traffic; performance management system; quality-of-service; queuing; request message; scheduling; server overload protection; server resource allocation; service differentiation; utility function; Delay; Load management; Measurement; Network interfaces; Network servers; Prototypes; Resource management; Service oriented architecture; Traffic control; Web services; Clustered computing; Web services; performance management; quality-of-service (QoS); resource allocation; service differentiation; utility functions;
  • fLanguage
    English
  • Journal_Title
    Selected Areas in Communications, IEEE Journal on
  • Publisher
    ieee
  • ISSN
    0733-8716
  • Type

    jour

  • DOI
    10.1109/JSAC.2005.857208
  • Filename
    1546102