Title :
The Continued Evolution of Web Traffic
Author :
Newton, Ben ; Jeffay, Kevin ; Aikat, Jay
Author_Institution :
Dept. of Comput. Sci., Univ. of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Abstract :
Over the last decade web content has evolved from relatively static pages often delivered by one or two servers, to websites rich with interactive media content served from numerous servers. This content change has affected the associated network traffic. Quantifying and analyzing these changes can lead to updated traffic models and more accurate web traffic simulations for testing new protocols and devices. In this work we analyze the TCP/IP headers in packet traces collected at various times over 13 years on the link that connects the University of North Carolina at Chapel Hill (UNC) to its ISP. We show that while the decade-old methodology for inferring web activity from these packet traces is still viable, it is no longer possible to infer all page boundaries given only the TCP and IP headers. We propose a novel method for segmenting web traffic into Activity Sections, in order to obtain comparable higher level statistics. Using these methods to analyze our data set, we describe trends in the HTTP request and response sizes, and a trend towards longer connection durations. We also show that the number of servers supporting web activity has increased, and present empirical evidence that suggests the number of unused connections has risen, likely due to new speculative TCP preconnect features of popular browsers.
Keywords :
Internet; Web sites; data analysis; telecommunication traffic; transport protocols; Chapel Hill; HTTP request; ISP; TCP-IP headers; University of North Carolina; Web activity inference; Web content; Web sites; Web traffic segmentation; Web traffic simulations; activity sections; connection durations; content change; continued Web traffic evolution; data set analysis; device testing; interactive media content; network traffic; packet traces; protocol testing; response sizes; Browsers; IP networks; Market research; Pipeline processing; Web pages; Web servers; TCP/IP headers; activity; empirical model; evolution; web traffic;
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on
Conference_Location :
San Francisco, CA
DOI :
10.1109/MASCOTS.2013.16