DocumentCode
3603563
Title
VEGAS: Visual influEnce GrAph Summarization on Citation Networks
Author
Lei Shi ; Hanghang Tong ; Jie Tang ; Chuang Lin
Author_Institution
State Key Lab. of Comput. Sci., Inst. of Software, Beijing, China
Volume
27
Issue
12
fYear
2015
Firstpage
3417
Lastpage
3431
Abstract
Visually analyzing citation networks poses challenges to many fields of the data mining research. How can we summarize a large citation graph according to the user´s interest? In particular, how can we illustrate the impact of a highly influential paper through the summarization? Can we maintain the sensory node-link graph structure while revealing the flow-based influence patterns and preserving a fine readability? The state-of-the-art influence maximization algorithms can detect the most influential node in a citation network, but fail to summarize a graph structure to account for its influence. On the other hand, existing graph summarization methods fold large graphs into clustered views, but can not reveal the hidden influence patterns underneath the citation network. In this paper, we first formally define the Influence Graph Summarization problem on citation networks. Second, we propose a matrix decomposition based algorithm pipeline to solve the IGS problem. Our method can not only highlight the flow-based influence patterns, but also easily extend to support the rich attribute information. A prototype system called VEGAS implementing this pipeline is also developed. Third, we present a theoretical analysis on our main algorithm, which is equivalent to the kernel k-mean clustering. It can be proved that the matrix decomposition based algorithm can approximate the objective of the proposed IGS problem. Last, we conduct comprehensive experiments with real-world citation networks to compare the proposed algorithm with classical graph summarization methods. Evaluation results demonstrate that our method significantly outperforms the previous ones in optimizing both the quantitative IGS objective and the quality of the visual summarizations.
Keywords
citation analysis; data mining; data visualisation; matrix decomposition; pattern clustering; IGS problem; VEGAS; citation networks; data mining; flow-based influence patterns; influence maximization algorithms; kernel k-mean clustering; large citation graph summarization; matrix decomposition based algorithm; rich attribute information; sensory node-link graph structure; visual influence graph summarization method; Algorithm design and analysis; Approximation algorithms; Citation analysis; Clustering algorithms; Matrix decomposition; Influence summarization; citation network; influence summarization; visualization;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2015.2453957
Filename
7152908
Link To Document