Title :
A hidden semi-Markov model for web workload self-similarity
Author :
Yu, Shun-Zheng ; Liu, Zhen ; Squillante, Mark S. ; Xia, Cathy ; Li Zhang
Author_Institution :
Res. Div., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Hidden semi-Markov models (HSMMs) have been well studied and successfully applied to many engineering and scientific problems. The advantage of using a HSMM is its efficient forward-backward algorithms for estimating model parameters to account for an observed sequence. In this paper, we propose a HSMM for modeling Web workloads. We show that this model asymptotically characterizes second order self-similar workloads when some duration distributions of the hidden states are heavy-tailed. A recursive formula is developed for estimating the Hurst parameter of self-similarity. We validate our model and estimation methods with respect to two sets of empirical data (requests per second) collected from two different Web servers. We then use this model to generate self-similar workloads that exhibit the same statistical properties. These measurements show that we can use as few as 4 states together with a simple Poisson process and heavy-tailed Pareto holding time distributions to accurately model the Web workloads considered in this study
Keywords :
Internet; hidden Markov models; parameter estimation; resource allocation; telecommunication traffic; Hurst parameter; Poisson process; Web workloads; hidden semiMarkov models; parameter estimation; self-similar workloads; Aggregates; Communication system traffic control; Local area networks; Parameter estimation; Queueing analysis; Recursive estimation; Telecommunication traffic; Time measurement; Traffic control; Web server;
Conference_Titel :
Performance, Computing, and Communications Conference, 2002. 21st IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-7371-5
DOI :
10.1109/IPCCC.2002.995137