DocumentCode :
3603563
Title :
VEGAS: Visual influEnce GrAph Summarization on Citation Networks
Author :
Lei Shi ; Hanghang Tong ; Jie Tang ; Chuang Lin
Author_Institution :
State Key Lab. of Comput. Sci., Inst. of Software, Beijing, China
Volume :
27
Issue :
12
fYear :
2015
Firstpage :
3417
Lastpage :
3431
Abstract :
Visually analyzing citation networks poses challenges to many fields of the data mining research. How can we summarize a large citation graph according to the user´s interest? In particular, how can we illustrate the impact of a highly influential paper through the summarization? Can we maintain the sensory node-link graph structure while revealing the flow-based influence patterns and preserving a fine readability? The state-of-the-art influence maximization algorithms can detect the most influential node in a citation network, but fail to summarize a graph structure to account for its influence. On the other hand, existing graph summarization methods fold large graphs into clustered views, but can not reveal the hidden influence patterns underneath the citation network. In this paper, we first formally define the Influence Graph Summarization problem on citation networks. Second, we propose a matrix decomposition based algorithm pipeline to solve the IGS problem. Our method can not only highlight the flow-based influence patterns, but also easily extend to support the rich attribute information. A prototype system called VEGAS implementing this pipeline is also developed. Third, we present a theoretical analysis on our main algorithm, which is equivalent to the kernel k-mean clustering. It can be proved that the matrix decomposition based algorithm can approximate the objective of the proposed IGS problem. Last, we conduct comprehensive experiments with real-world citation networks to compare the proposed algorithm with classical graph summarization methods. Evaluation results demonstrate that our method significantly outperforms the previous ones in optimizing both the quantitative IGS objective and the quality of the visual summarizations.
Keywords :
citation analysis; data mining; data visualisation; matrix decomposition; pattern clustering; IGS problem; VEGAS; citation networks; data mining; flow-based influence patterns; influence maximization algorithms; kernel k-mean clustering; large citation graph summarization; matrix decomposition based algorithm; rich attribute information; sensory node-link graph structure; visual influence graph summarization method; Algorithm design and analysis; Approximation algorithms; Citation analysis; Clustering algorithms; Matrix decomposition; Influence summarization; citation network; influence summarization; visualization;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2015.2453957
Filename :
7152908
Link To Document :
بازگشت