DocumentCode
679556
Title
Sampling Heterogeneous Networks
Author
Cheng-Lun Yang ; Perng-Hwa Kung ; Cheng-Te Li ; Chun-An Chen ; Shou-De Lin
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2013
fDate
7-10 Dec. 2013
Firstpage
1247
Lastpage
1252
Abstract
Online social networks are mainly characterized by large-scale and heterogeneous semantic relationships. Unfortunately, for online social network services such as Facebook or Twitter, it is very difficult to obtain the fully observed network without privilege to access the data internally. To address the above needs, social network sampling is a means that aims at identifying a representative subgraph that preserves certain properties of the network, given the information of any instance in the network is unknown before being sampled. This study tackles heterogeneous network sampling by considering the conditional dependency of node types and link types, where we design a property, Relational Profile, to account such characterization. We further propose a sampling method to preserve this property. Lastly, we propose to evaluate our model from three different angles. First, we show that the proposed sampling method can more faithfully preserve the Relational Profile. Second, we evaluate the usefulness of the Relational Profile showing such information is beneficial for link prediction tasks. Finally, we evaluate whether the networks sampled by our method can be used to train more accurate prediction models comparing to networks produced by other methods.
Keywords
graph theory; sampling methods; social networking (online); Facebook; Twitter; heterogeneous network sampling; heterogeneous semantic relationships; large-scale semantic relationships; link prediction tasks; link type conditional dependency; node type conditional dependency; online social network services; relational profile; representative subgraph identification; social network sampling; Equations; Mathematical model; Patents; Predictive models; Sampling methods; Semantics; Social network services; heterogeneous networks; network prediction; network sampling;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location
Dallas, TX
ISSN
1550-4786
Type
conf
DOI
10.1109/ICDM.2013.102
Filename
6729629
Link To Document