• DocumentCode
    3124108
  • Title

    LinkBoost: A Novel Cost-Sensitive Boosting Framework for Community-Level Network Link Prediction

  • Author

    Comar, Prakash Mandayam ; Tan, Pang-Ning ; Jain, Anil K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    131
  • Lastpage
    140
  • Abstract
    Link prediction is a challenging task due to the inherent skew ness of network data. Typical link prediction methods can be categorized as either local or global. Local methods consider the link structure in the immediate neighborhood of a node pair to determine the presence or absence of a link, whereas global methods utilize information from the whole network. This paper presents a community (cluster) level link prediction method without the need to explicitly identify the communities in a network. Specifically, a variable-cost loss function is defined to address the data skew ness problem. We provide theoretical proof that shows the equivalence between maximizing the well-known modularity measure used in community detection and minimizing a special case of the proposed loss function. As a result, any link prediction method designed to optimize the loss function would result in more links being predicted within a community than between communities. We design a boosting algorithm to minimize the loss function and present an approach to scale-up the algorithm by decomposing the network into smaller partitions and aggregating the weak learners constructed from each partition. Experimental results show that our proposed Link Boost algorithm consistently performs as good as or better than many existing methods when evaluated on 4 real-world network datasets.
  • Keywords
    data handling; network theory (graphs); LinkBoost; boosting algorithm; community level network link prediction; novel cost sensitive boosting framework; Algorithm design and analysis; Boosting; Communities; Loss measurement; Partitioning algorithms; Prediction algorithms; Predictive models; Boosting; Community Detection; Link Prediction; Modularity; Social Networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.93
  • Filename
    6137217