• DocumentCode
    1599845
  • Title

    A measurement-based study of big-data movement

  • Author

    Addanki, Ranjana ; Maji, Sourav ; Veeraraghavan, Malathi ; Tracy, Chris

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Virginia, Charlottesville, VA, USA
  • fYear
    2015
  • Firstpage
    445
  • Lastpage
    449
  • Abstract
    Parallel TCP connections are used for large scientific dataset transfers to increase throughput. Therefore, to accurately characterize big-data movement, it is important to reconstruct parallel flowsets from traffic measurements. In this work, we start with NetFlow records collected in an operational research-and-education network across which large scientific datasets are moved routinely, reconstruct individual elephant flows from the NetFlow records, and assemble parallel flowsets from elephant flows. Our findings are as follows. The top 1% of flowset sizes were in the hundreds of GBs to low TBs range, 95% of flowsets had rates less than 2.5 Gbps, and 99% of flowsets had durations shorter than 4 hours. Median flowset rate increases and rate variance decreases with increasing number of per-flowset component flows. Such findings are useful for network planning, traffic engineering, and for improving user performance, since large dataset transfers are among the most demanding of network applications.
  • Keywords
    data handling; parallel machines; transport protocols; NetFlow records; assemble parallel flowsets; big data movement; elephant flows; median flowset rate; network applications; network planning; operational research-and-education network; parallel TCP connections; parallel flowsets; scientific dataset; traffic engineering; traffic measurements; Europe; Frequency selective surfaces; IP networks; Indexes; Internet; Planning; Throughput; Measurements; data movement; elephant flows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networks and Communications (EuCNC), 2015 European Conference on
  • Conference_Location
    Paris
  • Type

    conf

  • DOI
    10.1109/EuCNC.2015.7194115
  • Filename
    7194115