Title :
On random walk based graph sampling
Author :
Rong-Hua Li ; Yu, Jeffrey Xu ; Lu Qin ; Rui Mao ; Tan Jin
Author_Institution :
Shenzhen Univ., Shenzhen, China
Abstract :
Random walk based graph sampling has been recognized as a fundamental technique to collect uniform node samples from a large graph. In this paper, we first present a comprehensive analysis of the drawbacks of three widely-used random walk based graph sampling algorithms, called re-weighted random walk (RW) algorithm, Metropolis-Hastings random walk (MH) algorithm and maximum-degree random walk (MD) algorithm. Then, to address the limitations of these algorithms, we propose two general random walk based algorithms, named rejection-controlled Metropolis-Hastings (RCMH) algorithm and generalized maximum-degree random walk (GMD) algorithm. We show that RCMH balances the tradeoff between the limitations of RW and MH, and GMD balances the tradeoff between the drawbacks of RW and MD. To further improve the performance of our algorithms, we integrate the so-called delayed acceptance technique and the non-backtracking random walk technique into RCMH and GMD respectively. We conduct extensive experiments over four real-world datasets, and the results demonstrate the effectiveness of the proposed algorithms.
Keywords :
graph theory; sampling methods; GMD algorithm; MH algorithm; Metropolis-Hastings random walk; RCMH algorithm; RW algorithm; delayed acceptance technique; generalized maximum-degree random walk algorithm; random walk based graph sampling; rejection-controlled Metropolis-Hastings algorithm; reweighted random walk; Algorithm design and analysis; Context; Estimation; Heuristic algorithms; Markov processes; Network topology; Social network services;
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDE.2015.7113345