DocumentCode :
3387326
Title :
Investigating Distribution of Data of HTTP Traffic: An Empirical Study
Author :
Chehadeh, Y.C. ; Hatahet, A.Z. ; Agamy, A.E. ; Bamakhrama, M.A. ; Banawan, S.A.
Author_Institution :
Modelware Inc., Red Bank, NJ
fYear :
2006
fDate :
Nov. 2006
Firstpage :
1
Lastpage :
5
Abstract :
Internet traffic today is dominated by that of the hypertext transfer protocol (HTTP). Understanding the statistical characteristics of the data transferred via HTTP helps better model traffic patterns. In this work, we conduct an empirical study by employing an experiment that accesses roughly 34,000 of the most popular Web sites on the Internet today and crawls their Web pages. We collect metadata information on the retrieved roughly two million objects. We determine statistics and distributions based on object sizes, occurrence of specific types, and sizes of specific types. The data of the distributions produced can be used as a template model for Web-traffic modeling in future research. We further note an intriguing result that 5.7% of HTTP traffic from Web servers to clients is due to sending spacer objects (image files representing a 1times1 white-space pixel) or to stale links referencing non-existing files. Such squander in bandwidth is not due to overhead and can be minimized by simple additions to the HTML standard and by automating the process of removing stale links
Keywords :
Internet; Web sites; hypermedia; meta data; transport protocols; HTTP traffic pattern; Internet traffic; Web page; Web server; Web sites; Web traffic modeling; data distribution; hypertext transfer protocol; metadata information; spacer object; stale link; statistical data transfer characteristics; template model; Access protocols; Bandwidth; Information retrieval; Internet; Pixel; Statistical distributions; Traffic control; Web pages; Web server; White spaces;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Information Technology, 2006
Conference_Location :
Dubai
Print_ISBN :
1-4244-0674-9
Electronic_ISBN :
1-4244-0674-9
Type :
conf
DOI :
10.1109/INNOVATIONS.2006.301928
Filename :
4085443
Link To Document :
بازگشت