DocumentCode
3172153
Title
The Continued Evolution of Web Traffic
Author
Newton, Ben ; Jeffay, Kevin ; Aikat, Jay
Author_Institution
Dept. of Comput. Sci., Univ. of North Carolina at Chapel Hill, Chapel Hill, NC, USA
fYear
2013
fDate
14-16 Aug. 2013
Firstpage
80
Lastpage
89
Abstract
Over the last decade web content has evolved from relatively static pages often delivered by one or two servers, to websites rich with interactive media content served from numerous servers. This content change has affected the associated network traffic. Quantifying and analyzing these changes can lead to updated traffic models and more accurate web traffic simulations for testing new protocols and devices. In this work we analyze the TCP/IP headers in packet traces collected at various times over 13 years on the link that connects the University of North Carolina at Chapel Hill (UNC) to its ISP. We show that while the decade-old methodology for inferring web activity from these packet traces is still viable, it is no longer possible to infer all page boundaries given only the TCP and IP headers. We propose a novel method for segmenting web traffic into Activity Sections, in order to obtain comparable higher level statistics. Using these methods to analyze our data set, we describe trends in the HTTP request and response sizes, and a trend towards longer connection durations. We also show that the number of servers supporting web activity has increased, and present empirical evidence that suggests the number of unused connections has risen, likely due to new speculative TCP preconnect features of popular browsers.
Keywords
Internet; Web sites; data analysis; telecommunication traffic; transport protocols; Chapel Hill; HTTP request; ISP; TCP-IP headers; University of North Carolina; Web activity inference; Web content; Web sites; Web traffic segmentation; Web traffic simulations; activity sections; connection durations; content change; continued Web traffic evolution; data set analysis; device testing; interactive media content; network traffic; packet traces; protocol testing; response sizes; Browsers; IP networks; Market research; Pipeline processing; Web pages; Web servers; TCP/IP headers; activity; empirical model; evolution; web traffic;
fLanguage
English
Publisher
ieee
Conference_Titel
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on
Conference_Location
San Francisco, CA
ISSN
1526-7539
Type
conf
DOI
10.1109/MASCOTS.2013.16
Filename
6730751
Link To Document