DocumentCode
3570936
Title
A hidden treasure? Evaluating and extending latent methods for link-based classification
Author
Fleming, Aaron ; McDowell, Luke K. ; Markel, Zane
Author_Institution
Dept. Comput. Sci., U.S. Naval Acad., Annapolis, MD, USA
fYear
2014
Firstpage
669
Lastpage
676
Abstract
Many information tasks involve objects that are explicitly or implicitly connected in a network, such as webpages connected by hyperlinks or people linked by "friendships" in a social network. Research on link-based classification (LBC) has studied how to leverage these connections to improve classification accuracy. This research broadly falls into two groups. First, there are methods that use the original attributes and/or links of the network, via a link-aware supervised classifier or via a non-learning method based on label propagation or random walks. Second, there are recent methods that first compute a set of latent features or links that summarize the network, then use a (hopefully simpler) supervised classifier or label propagation method. Some work has claimed that the latent methods can improve accuracy, but has not adequately compared with the best non-latent methods. In response, this paper provides the first substantial comparison between these two groups. We find that certain non-latent methods typically provide the best overall accuracy, but that latent methods can be competitive when a network is densely-labeled or when the attributes are not very informative. Moreover, we introduce two novel combinations of these methods that in some cases substantially increase accuracy.
Keywords
learning (artificial intelligence); network theory (graphs); pattern classification; LBC; densely-labeled network; label propagation method; latent methods; link-aware supervised classifier; link-based classification; nonlearning method; random walks; social network; supervised classifier; Accuracy; Logistics; Niobium; Prediction algorithms; Proteins; Support vector machines; Taxonomy;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type
conf
DOI
10.1109/IRI.2014.7051954
Filename
7051954
Link To Document