• DocumentCode
    3172153
  • Title

    The Continued Evolution of Web Traffic

  • Author

    Newton, Ben ; Jeffay, Kevin ; Aikat, Jay

  • Author_Institution
    Dept. of Comput. Sci., Univ. of North Carolina at Chapel Hill, Chapel Hill, NC, USA
  • fYear
    2013
  • fDate
    14-16 Aug. 2013
  • Firstpage
    80
  • Lastpage
    89
  • Abstract
    Over the last decade web content has evolved from relatively static pages often delivered by one or two servers, to websites rich with interactive media content served from numerous servers. This content change has affected the associated network traffic. Quantifying and analyzing these changes can lead to updated traffic models and more accurate web traffic simulations for testing new protocols and devices. In this work we analyze the TCP/IP headers in packet traces collected at various times over 13 years on the link that connects the University of North Carolina at Chapel Hill (UNC) to its ISP. We show that while the decade-old methodology for inferring web activity from these packet traces is still viable, it is no longer possible to infer all page boundaries given only the TCP and IP headers. We propose a novel method for segmenting web traffic into Activity Sections, in order to obtain comparable higher level statistics. Using these methods to analyze our data set, we describe trends in the HTTP request and response sizes, and a trend towards longer connection durations. We also show that the number of servers supporting web activity has increased, and present empirical evidence that suggests the number of unused connections has risen, likely due to new speculative TCP preconnect features of popular browsers.
  • Keywords
    Internet; Web sites; data analysis; telecommunication traffic; transport protocols; Chapel Hill; HTTP request; ISP; TCP-IP headers; University of North Carolina; Web activity inference; Web content; Web sites; Web traffic segmentation; Web traffic simulations; activity sections; connection durations; content change; continued Web traffic evolution; data set analysis; device testing; interactive media content; network traffic; packet traces; protocol testing; response sizes; Browsers; IP networks; Market research; Pipeline processing; Web pages; Web servers; TCP/IP headers; activity; empirical model; evolution; web traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1526-7539
  • Type

    conf

  • DOI
    10.1109/MASCOTS.2013.16
  • Filename
    6730751