Title :
Circumventing Server Bottlenecks: Indirect Large-Scale P2P Data Collection
Author :
Niu, Di ; Li, Baochun
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON
Abstract :
In most large-scale peer-to-peer (P2P) applications, it is necessary to collect vital statistics data - sometimes referred to as logs - from up to millions of peers. Traditional solutions involve sending large volumes of such data to centralized logging servers, which are not scalable. In addition, they may not be able to retrieve statistics data from departed peers in dynamic peer-to-peer systems. In this paper, we solve this dilemma through an indirect collection mechanism that distributes data using random network coding across the network, from which servers proactively pull such statistics. By buffering data in a decentralized fashion with only a small portion of peer resources, we show that our new mechanism provides a "buffering" zone and a "smoothing" factor to collect large volumes of statistics, with appropriate resilience to peer dynamics and scalability to a large peer population.
Keywords :
peer-to-peer computing; statistical analysis; centralized logging servers; indirect large-scale P2P data collection; large-scale peer-to-peer applications; random network coding; server bottlenecks; Information retrieval; Large-scale systems; Network coding; Network servers; Peer to peer computing; Resilience; Scalability; Smoothing methods; Statistical distributions; Statistics;
Conference_Titel :
Distributed Computing Systems, 2008. ICDCS '08. The 28th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-3172-4
Electronic_ISBN :
1063-6927
DOI :
10.1109/ICDCS.2008.74