DocumentCode :
1350633
Title :
The Small World of File Sharing
Author :
Iamnitchi, Adriana ; Ripeanu, Matei ; Santos-Neto, Elizeu ; Foster, Ian
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
Volume :
22
Issue :
7
fYear :
2011
fDate :
7/1/2011 12:00:00 AM
Firstpage :
1120
Lastpage :
1134
Abstract :
Web caches, content distribution networks, peer-to-peer file-sharing networks, distributed file systems, and data grids all have in common that they involve a community of users who use shared data. In each case, overall system performance can be improved significantly by first identifying and then exploiting the structure of community´s data access patterns. We propose a novel perspective for analyzing data access workloads that considers the implicit relationships that form among users based on the data they access. We propose a new structure-the interest-sharing graph-that captures common user interests in data and justify its utility with studies on four data-sharing systems: a high-energy physics collaboration, the Web, the Kazaa peer-to-peer network, and a BitTorrent file-sharing community. We find small-world patterns in the interest-sharing graphs of all four communities. We investigate analytically and experimentally some of the potential causes that lead to this pattern and conclude that user preferences play a major role. The significance of small-world patterns is twofold: it provides a rigorous support to intuition and it suggests the potential to exploit these naturally emerging patterns. As a proof of concept, we design and evaluate an information dissemination system that exploits the small-world interest-sharing graphs by building an interest-aware network overlay. We show that this approach leads to improved information dissemination performance.
Keywords :
Internet; information dissemination; information retrieval; peer-to-peer computing; BitTorrent file-sharing community; Kazaa peer-to-peer network; Web caches; community data access patterns; content distribution networks; data grids; distributed file systems; high-energy physics collaboration; information dissemination system; interest-aware network overlay; interest-sharing graph; peer-to-peer file-sharing networks; Collaboration; Communities; Distributed databases; Internet; Peer to peer computing; Physics; Protocols; File sharing; peer-to-peer systems.; self-organization; small-world graphs; workload characterization;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2010.170
Filename :
5601708
Link To Document :
بازگشت