DocumentCode
3621990
Title
A Contribution Towards Solving the Web Workload Puzzle
Author
K. Goseva-Popstojanova; Fengbin Li; Xuan Wang;A. Sangle
Author_Institution
West Virginia University, Morgantown, WV
fYear
2006
fDate
6/28/1905 12:00:00 AM
Firstpage
505
Lastpage
516
Abstract
World Wide Web, the biggest distributed system ever built, experiences tremendous growth and change in Web sites, users, and technology. A realistic and accurate characterization of Web workload is the first, fundamental step in areas such as performance analysis and prediction, capacity planning, and admission control. Compared to the previous work, in this paper we present more detailed and rigorous statistical analysis of both request and session level characteristics of Web workload based on empirical data extracted from actual logs of four Web servers. Our analysis is focused on exploring phenomena such as self-similarity, long-range dependence, and heavy-tailed distributions. Identification of these phenomena in real data is a challenging task since the existing methods may perform erratically in practice and produce misleading results. We provide more accurate analysis of long-range dependence of the request and session arrival processes by removing the trend and periodicity. In addition to the session arrival process (i.e., inter-session characteristics), we study several intra-session characteristics using several different methods to test the existence of heavy-tailed behavior and cross validate the results. Finally, we point out specific problems associated with the methods used for establishing long-range dependence and heavy-tailed behavior of Web workloads. We believe that the comprehensive model presented in this paper is a step towards solving the Web workload puzzle
Keywords
"Telecommunication traffic","Traffic control","Web sites","Performance analysis","Admission control","Web server","Computer science","Capacity planning","Statistical analysis","Data mining"
Publisher
ieee
Conference_Titel
Dependable Systems and Networks, 2006. DSN 2006. International Conference on
Print_ISBN
0-7695-2607-1
Type
conf
DOI
10.1109/DSN.2006.2
Filename
1633539
Link To Document