• DocumentCode
    3570936
  • Title

    A hidden treasure? Evaluating and extending latent methods for link-based classification

  • Author

    Fleming, Aaron ; McDowell, Luke K. ; Markel, Zane

  • Author_Institution
    Dept. Comput. Sci., U.S. Naval Acad., Annapolis, MD, USA
  • fYear
    2014
  • Firstpage
    669
  • Lastpage
    676
  • Abstract
    Many information tasks involve objects that are explicitly or implicitly connected in a network, such as webpages connected by hyperlinks or people linked by "friendships" in a social network. Research on link-based classification (LBC) has studied how to leverage these connections to improve classification accuracy. This research broadly falls into two groups. First, there are methods that use the original attributes and/or links of the network, via a link-aware supervised classifier or via a non-learning method based on label propagation or random walks. Second, there are recent methods that first compute a set of latent features or links that summarize the network, then use a (hopefully simpler) supervised classifier or label propagation method. Some work has claimed that the latent methods can improve accuracy, but has not adequately compared with the best non-latent methods. In response, this paper provides the first substantial comparison between these two groups. We find that certain non-latent methods typically provide the best overall accuracy, but that latent methods can be competitive when a network is densely-labeled or when the attributes are not very informative. Moreover, we introduce two novel combinations of these methods that in some cases substantially increase accuracy.
  • Keywords
    learning (artificial intelligence); network theory (graphs); pattern classification; LBC; densely-labeled network; label propagation method; latent methods; link-aware supervised classifier; link-based classification; nonlearning method; random walks; social network; supervised classifier; Accuracy; Logistics; Niobium; Prediction algorithms; Proteins; Support vector machines; Taxonomy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/IRI.2014.7051954
  • Filename
    7051954