DocumentCode
880488
Title
On Unbiased Sampling for Unstructured Peer-to-Peer Networks
Author
Stutzbach, Daniel ; Rejaie, Reza ; Duffield, Nick ; Sen, Subhabrata ; Willinger, Walter
Author_Institution
Stutzbach Enterprises, Dallas, TX
Volume
17
Issue
2
fYear
2009
fDate
4/1/2009 12:00:00 AM
Firstpage
377
Lastpage
390
Abstract
This paper presents a detailed examination of how the dynamic and heterogeneous nature of real-world peer-to-peer systems can introduce bias into the selection of representative samples of peer properties (e.g., degree, link bandwidth, number of files shared). We propose the metropolized random walk with backtracking (MRWB) as a viable and promising technique for collecting nearly unbiased samples and conduct an extensive simulation study to demonstrate that our technique works well for a wide variety of commonly-encountered peer-to-peer network conditions. We have implemented the MRWB algorithm for selecting peer addresses uniformly at random into a tool called ion-sampler. Using the Gnutella network, we empirically show that ion-sampler yields more accurate samples than tools that rely on commonly-used sampling techniques and results in dramatic improvements in efficiency and scalability compared to performing a full crawl.
Keywords
backtracking; peer-to-peer computing; random processes; sampling methods; telecommunication network topology; Gnutella network; backtracking; ion-sampler; metropolized random walk; network topology; unbiased sampling; unstructured peer-to-peer network; Peer-to-peer; sampling;
fLanguage
English
Journal_Title
Networking, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1063-6692
Type
jour
DOI
10.1109/TNET.2008.2001730
Filename
4637905
Link To Document