DocumentCode
714191
Title
Deduping the Internet: An email case study
Author
Williamson, Carey
Author_Institution
Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB, Canada
fYear
2015
fDate
3-6 May 2015
Firstpage
1345
Lastpage
1350
Abstract
Much of the traffic that traverses the Internet each day is redundant. That is, some or all of the data content has been sent previously. From a technical viewpoint, this represents a waste of resources, in terms of network bandwidth, storage, and energy efficiency. This paper presents an initial feasibility study to assess the potential of data deduplication technologies to reduce Internet traffic. The case study focuses on electronic mail (email), using an email dataset collected over the past 8 years. The results from this longitudinal study suggest that the size, complexity, and redundancy of email messages have all increased over this time duration, as has the complexity of the email delivery infrastructure. The results indicate that bandwidth savings of 30-45% are possible using existing redundant traffic elimination techniques on email messages.
Keywords
Internet; electronic mail; telecommunication traffic; Internet traffic reduction; data deduplication technologies; electronic mail; email case study; email delivery infrastructure; redundant traffic elimination technique; Bandwidth; Complexity theory; Electronic mail; Internet; Portable document format; Redundancy; Routing;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
Conference_Location
Halifax, NS
ISSN
0840-7789
Print_ISBN
978-1-4799-5827-6
Type
conf
DOI
10.1109/CCECE.2015.7129474
Filename
7129474
Link To Document