• DocumentCode
    714191
  • Title

    Deduping the Internet: An email case study

  • Author

    Williamson, Carey

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB, Canada
  • fYear
    2015
  • fDate
    3-6 May 2015
  • Firstpage
    1345
  • Lastpage
    1350
  • Abstract
    Much of the traffic that traverses the Internet each day is redundant. That is, some or all of the data content has been sent previously. From a technical viewpoint, this represents a waste of resources, in terms of network bandwidth, storage, and energy efficiency. This paper presents an initial feasibility study to assess the potential of data deduplication technologies to reduce Internet traffic. The case study focuses on electronic mail (email), using an email dataset collected over the past 8 years. The results from this longitudinal study suggest that the size, complexity, and redundancy of email messages have all increased over this time duration, as has the complexity of the email delivery infrastructure. The results indicate that bandwidth savings of 30-45% are possible using existing redundant traffic elimination techniques on email messages.
  • Keywords
    Internet; electronic mail; telecommunication traffic; Internet traffic reduction; data deduplication technologies; electronic mail; email case study; email delivery infrastructure; redundant traffic elimination technique; Bandwidth; Complexity theory; Electronic mail; Internet; Portable document format; Redundancy; Routing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
  • Conference_Location
    Halifax, NS
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4799-5827-6
  • Type

    conf

  • DOI
    10.1109/CCECE.2015.7129474
  • Filename
    7129474