DocumentCode
695364
Title
Robustness of Network Centrality Metrics in the Context of Digital Communication Data
Author
Ju-Sung Lee ; Pfeffer, Juergen
fYear
2015
fDate
5-8 Jan. 2015
Firstpage
1798
Lastpage
1807
Abstract
Social media data and other web-based network data are large and dynamic rendering the identification of structural changes in such systems a hard problem. Typically, online data is constantly streaming and results in data that is incomplete thus necessitating the need to understand the robustness of network metrics on partial or sampled network data. In this paper, we examine the effects of sampling on key network centrality metrics using two empirical communication datasets. Correlations between network metrics of original and sampled nodes offer a measure of sampling accuracy. The relationship between sampling and accuracy is convergent and amenable to nonlinear analysis. Naturally, larger edge samples induce sampled graphs that are more representative of the original graph. However, this effect is attenuated when larger sets of nodes are recovered in the samples. Also, we find that the graph structure plays a prominent role in sampling accuracy. Centralized graphs, in which fewer nodes enjoy higher centrality scores, offer more representative samples.
Keywords
Internet; digital communication; graph theory; social networking (online); Web-based network data; centralized graphs; digital communication data; empirical communication datasets; key network centrality metrics; network metrics; nonlinear analysis; sampled graphs; social media data; Accuracy; Correlation; Electronic mail; Measurement; Robustness; Standards; Twitter; digital communication; network analysis; sampling;
fLanguage
English
Publisher
ieee
Conference_Titel
System Sciences (HICSS), 2015 48th Hawaii International Conference on
Conference_Location
Kauai, HI
ISSN
1530-1605
Type
conf
DOI
10.1109/HICSS.2015.217
Filename
7070028
Link To Document