• DocumentCode
    1331032
  • Title

    Multigraph Sampling of Online Social Networks

  • Author

    Gjoka, Minas ; Butts, C.T. ; Kurant, Maciej ; Markopoulou, Athina

  • Author_Institution
    California Inst. for Telecommun. & Inf. Technol. (CalIT2), Univ. of California, Irvine, CA, USA
  • Volume
    29
  • Issue
    9
  • fYear
    2011
  • fDate
    10/1/2011 12:00:00 AM
  • Firstpage
    1893
  • Lastpage
    1905
  • Abstract
    State-of-the-art techniques for probability sampling of users of online social networks (OSNs) are based on random walks on a single social relation (typically friendship). While powerful, these methods rely on the social graph being fully connected. Furthermore, the mixing time of the sampling process strongly depends on the characteristics of this graph. In this paper, we observe that there often exist other relations between OSN users, such as membership in the same group or participation in the same event. We propose to exploit the graphs these relations induce, by performing a random walk on their union multigraph. We design a computationally efficient way to perform multigraph sampling by randomly selecting the graph on which to walk at each iteration. We demonstrate the benefits of our approach through (i) simulation in synthetic graphs, and (ii) measurements of Last.fm- an Internet website for music with social networking features. More specifically, we show that multigraph sampling can obtain a representative sample and faster convergence, even when the individual graphs fail, i.e., are disconnected or highly clustered.
  • Keywords
    graph theory; sampling methods; social networking (online); multigraph sampling; online social networks; probability sampling; random walks; sampling process; social graph; social networking features; social relation; synthetic graphs; union multigraph; Communities; Convergence; Crawlers; Erbium; Internet; Markov processes; Social network services; Graph sampling; Last.fm; Multigraph; Random walks; Sampling methods; Social network services;
  • fLanguage
    English
  • Journal_Title
    Selected Areas in Communications, IEEE Journal on
  • Publisher
    ieee
  • ISSN
    0733-8716
  • Type

    jour

  • DOI
    10.1109/JSAC.2011.111012
  • Filename
    6027869