• DocumentCode
    3430232
  • Title

    Memory management for scalable Web data servers

  • Author

    Venkataraman, Shivakumar ; Livny, Miron ; Naughton, Jeffrey F.

  • Author_Institution
    IBM Santa Teresa Labs., San Jose, CA, USA
  • fYear
    1997
  • fDate
    7-11 Apr 1997
  • Firstpage
    510
  • Lastpage
    519
  • Abstract
    Popular Web sites are already experiencing very heavy loads, and these loads will only increase as the number of users accessing them grows. These loads create both CPU and I/O bottlenecks. One promising solution already being employed to eliminate the CPU bottleneck is to replace a single processor server with a cluster of servers. Our goal in this paper is to develop buffer management algorithms that exploit the aggregate memory capacity of the machines in such a server cluster to attack the I/O bottleneck. The key challenge in designing such buffer management algorithms turns out to be controlling data replication so as to achieve a good balance between intra-cluster network traffic and disk I/O. At one extreme, the straightforward application of client-server memory management techniques to this cluster architecture causes duplication in memory among the servers and this tends to reduce network traffic but increases disk I/O, whereas at the other extreme, eliminating all duplicates tends to increase network traffic while reducing disk I/O. Accordingly, we present a new algorithm, called Hybrid, that dynamically controls the amount of duplication. Through a detailed simulation, we show that, on workloads that are characteristic of those experienced by Web servers, the Hybrid algorithm correctly trades off intra-cluster network traffic and disk I/O to minimize average response time
  • Keywords
    Internet; buffer storage; client-server systems; file servers; storage management; telecommunication traffic; CPU bottleneck; Hybrid algorithm; World Wide Web; aggregate memory capacity; average response time minimization; buffer management algorithms; client-server memory management techniques; data replication control; disk I/O bottleneck; intra-cluster network traffic; memory duplication control; scalable Web data servers; server cluster architecture; simulation; workloads; Algorithm design and analysis; Clustering algorithms; File servers; Memory management; Monitoring; Network servers; Operating systems; Switches; TCPIP; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 1997. Proceedings. 13th International Conference on
  • Conference_Location
    Birmingham
  • ISSN
    1063-6382
  • Print_ISBN
    0-8186-7807-0
  • Type

    conf

  • DOI
    10.1109/ICDE.1997.582018
  • Filename
    582018