• DocumentCode
    2394465
  • Title

    Understanding Graph Sampling Algorithms for Social Network Analysis

  • Author

    Wang, Tianyi ; Chen, Yang ; Zhang, Zengbin ; Xu, Tianyin ; Jin, Long ; Hui, Pan ; Deng, Beixing ; Li, Xing

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    20-24 June 2011
  • Firstpage
    123
  • Lastpage
    128
  • Abstract
    Being able to keep the graph scale small while capturing the properties of the original social graph, graph sampling provides an efficient, yet inexpensive solution for social network analysis. The challenge is how to create a small, but representative sample out of the massive social graph with millions or even billions of nodes. Several sampling algorithms have been proposed in previous studies, but there lacks fair evaluation and comparison among them. In this paper, we analyze the state-of art graph sampling algorithms and evaluate their performance on some widely recognized graph properties on directed graphs using large-scale social network datasets. We evaluate not only the commonly used node degree distribution, but also clustering coefficient, which quantifies how well connected are the neighbors of a node in a graph. Through the comparison we have found that none of the algorithms is able to obtain satisfied sampling results in both of these properties, and the performance of each algorithm differs much in different kinds of datasets.
  • Keywords
    graph theory; sampling methods; social networking (online); graph sampling algorithms; graph scale; social graph; social network analysis; Algorithm design and analysis; Clustering algorithms; Electronic publishing; Encyclopedias; Internet; Proposals; Social network services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Distributed Computing Systems Workshops (ICDCSW), 2011 31st International Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1545-0678
  • Print_ISBN
    978-1-4577-0384-3
  • Electronic_ISBN
    1545-0678
  • Type

    conf

  • DOI
    10.1109/ICDCSW.2011.34
  • Filename
    5961350