• DocumentCode
    172567
  • Title

    Estimating clique composition and size distributions from sampled network data

  • Author

    Gjoka, Minas ; Smith, Elena ; Butts, Carter

  • Author_Institution
    Univ. of California, Irvine, Irvine, CA, USA
  • fYear
    2014
  • fDate
    April 27 2014-May 2 2014
  • Firstpage
    837
  • Lastpage
    842
  • Abstract
    Cliques are defined as complete graphs or subgraphs; they are the strongest form of cohesive subgroup, and are of interest in both social science and engineering contexts. In this paper we show how to efficiently estimate the distribution of clique sizes from a probability sample of nodes obtained from a graph (e.g., by independence or link-trace sampling). We introduce two types of unbiased estimators, one of which exploits labeling of sampled nodes neighbors and one of which does not require this information. This is the first work to present statistically principled design-based estimators for clique distributions in arbitrary graphs using sampled network data. We generalize our estimators to cases in which cliques are distinguished not only by size but also by node attributes, allowing us to estimate clique composition by size. Last, we compare our estimators on a variety of real-world graphs and provide suggestions for their use.
  • Keywords
    graph theory; probability; social networking (online); statistical analysis; clique composition estimation; clique size distribution; cohesive subgroup; complete graph; node attributes; probability node sample; sampled network data; sampled nodes neighbor labeling; social network analysis; statistically principled design-based estimators; subgraph; unbiased estimators; Communication networks; Conferences; Estimation; Labeling; Sociology; Statistics; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications Workshops (INFOCOM WKSHPS), 2014 IEEE Conference on
  • Conference_Location
    Toronto, ON
  • Type

    conf

  • DOI
    10.1109/INFCOMW.2014.6849339
  • Filename
    6849339