• DocumentCode
    1940010
  • Title

    A first look at inter-data center traffic characteristics via Yahoo! datasets

  • Author

    Chen, Yingying ; Jain, Sourabh ; Adhikari, Vijay Kumar ; Zhang, Zhi-Li ; Xu, Kuai

  • Author_Institution
    Univ. of Minnesota-Twin Cities, Minneapolis, MN, USA
  • fYear
    2011
  • fDate
    10-15 April 2011
  • Firstpage
    1620
  • Lastpage
    1628
  • Abstract
    Effectively managing multiple data centers and their traffic dynamics pose many challenges to their operators, as little is known about the characteristics of inter-data center (D2D) traffic. In this paper we present a first study of D2D traffic characteristics using the anonymized NetFlow datasets collected at the border routers of five major Yahoo! data centers. Our contributions are mainly two-fold: i) we develop novel heuristics to infer the Yahoo! IP addresses and localize their locations from the anonymized NetFlow datasets, and ii) we study and analyze both D2D and client traffic characteristics and the correlations between these two types of traffic. Our study reveals that Yahoo! uses a hierarchical way of deploying data centers, with several satellite data centers distributed in other countries and backbone data centers distributed in US locations. For Yahoo! US data centers, we separate the client-triggered D2D traffic and background D2D traffic from the aggregate D2D traffic using port based correlation, and study their respective characteristics. Our findings shed light on the interplay of multiple data centers and their traffic dynamics within a large content provider, and provide insights to data center designers and operators as well as researchers.
  • Keywords
    Internet; computer centres; telecommunication traffic; IP addresses; Yahoo! datasets; anonymized NetFlow datasets; client-triggered D2D traffic; data center management; interdata center traffic; Correlation; IP networks; Integrated circuits; Satellites; Anonymization; Content provider; Inter-data center; NetFlow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2011 Proceedings IEEE
  • Conference_Location
    Shanghai
  • ISSN
    0743-166X
  • Print_ISBN
    978-1-4244-9919-9
  • Type

    conf

  • DOI
    10.1109/INFCOM.2011.5934955
  • Filename
    5934955