DocumentCode :
2323150
Title :
Unbiased Sampling of Bipartite Graph
Author :
Wang, Jing ; Guo, Yuchun
Author_Institution :
Sch. of Electron. & Inf. Eng., Beijing Jiaotong Univ., Beijing, China
fYear :
2011
fDate :
10-12 Oct. 2011
Firstpage :
357
Lastpage :
360
Abstract :
Increasing size of online social networks (OSNs) has given rise to sampling method studies that provide a relatively small but representative sample of large-scale OSNs so that the measurement and analysis burden can be affordable. So far, a number of sampling methods already exist that crawl social graphs. Most of them are suitable for one-mode graph where there is only one type of nodes. Literatures show that Metropolis-Hastings Random Walk (MHRW) produces unbiased samples with better performance than other sampling methods. But there are more and more online social networking sites with two types of nodes, such as Taobao and eBay. Representing these two-mode networks as bipartite graphs, we study the sampling methods for bipartite graphs in this paper. Our contributions include analyze the effectiveness of extending MHRW algorithm to bipartite graphs and making a modification in sampling procedure to improve the stability. Finally, we compare our MHRW sampling algorithm with Random Walk (RW) over the generated bipartite graphs as well as real two-mode network graphs. Simulations show that MHRW outperforms RW over bipartite graphs.
Keywords :
graph theory; random processes; sampling methods; social networking (online); MHRW algorithm; MHRW sampling algorithm; Taobao; bipartite graph; crawl social graphs; eBay; large-scale OSN; metropolis-Hastings random walk; one-mode graph; online social networking sites; online social networks; real two-mode network graphs; sampling method study; sampling procedure; two-mode networks; unbiased samples; unbiased sampling; Algorithm design and analysis; Bipartite graph; Educational institutions; Internet; Motion pictures; Sampling methods; Social network services; bipartite graph; crawling; online social networks; random walks; sampling ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1827-4
Type :
conf
DOI :
10.1109/CyberC.2011.63
Filename :
6079455
Link To Document :
بازگشت