DocumentCode :
2864677
Title :
Characterizing Twitter with Respondent-Driven Sampling
Author :
Salehi, Mostafa ; Rabiee, Hamid R. ; Nabavi, Nasim ; Pooya, Shayan
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
fYear :
2011
fDate :
12-14 Dec. 2011
Firstpage :
1211
Lastpage :
1217
Abstract :
Twitter as one of the most important microblogging online social networks has attracted more than 200 million users in recent years. Although there have been several attempts on characterizing the Twitter by using incomplete sampled data, they have not been very successful to estimate the characteristics of the whole network. In this paper, we characterize Twitter by sampling from its social graph and user behaviors through a random walk based sampling technique called Respondent-Driven Sampling (RDS). To the best of our knowledge, for the first time RDS method and its estimator are used in order to obtain uniform unbiased estimation of several key structural and behavioral properties of Twitter. We compared the performance of the proposed method with other sampling methods such as Metropolis-Hasting Random Walk (MHRW) and sampling from active users (Timeline) against the uniform sampling (UNI). In order to gather the required data, we have implemented four independent crawlers. Our experimental results indicate that the RDS method exhibits lower estimation errors to the sample in- and out-degree distribution compared to MHRW and Timeline. We also show that RDS is more suitable to sample the followers vs. followings ratio, and the correlation between followers/followings vs. tweets.
Keywords :
graph theory; random processes; sampling methods; social networking (online); MHRW; Metropolis-Hasting random walk; RDS method; Twitter; UNI; estimation error; microblogging online social network; random walk based sampling technique; respondent-driven sampling; social graph; uniform sampling; user behavior; Correlation; Facebook; Mathematical model; Peer to peer computing; Sampling methods; Twitter; Crawling; MHRW; Online Social Network; Public Timeline; RDS; Sampling; Twitter; Uniform;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4673-0006-3
Type :
conf
DOI :
10.1109/DASC.2011.196
Filename :
6118852
Link To Document :
بازگشت