DocumentCode :
140793
Title :
Fast incremental SimRank on link-evolving graphs
Author :
Weiren Yu ; Xuemin Lin ; Wenjie Zhang
Author_Institution :
Univ. of New South Wales, Sydney, NSW, Australia
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
304
Lastpage :
315
Abstract :
SimRank is an arresting measure of node-pair similarity based on hyperlinks. It iteratively follows the concept that 2 nodes are similar if they are referenced by similar nodes. Real graphs are often large, and links constantly evolve with small changes over time. This paper considers fast incremental computations of SimRank on link-evolving graphs. The prior approach [12] to this issue factorizes the graph via a singular value decomposition (SVD) first, and then incrementally maintains this factorization for link updates at the expense of exactness. Consequently, all node-pair similarities are estimated in O(r4n2) time on a graph of n nodes, where r is the target rank of the low-rank approximation, which is not negligibly small in practice. In this paper, we propose a novel fast incremental paradigm. (1) We characterize the SimRank update matrix ΔS, in response to every link update, via a rank-one Sylvester matrix equation. By virtue of this, we devise a fast incremental algorithm computing similarities of n2 node-pairs in O(Kn2) time for K iterations. (2) We also propose an effective pruning technique capturing the “affected areas” of ΔS to skip unnecessary computations, without loss of exactness. This can further accelerate the incremental SimRank computation to O(K(nd+|AFF|)) time, where d is the average in-degree of the old graph, and |AFF| (≤ n2) is the size of “affected areas” in ΔS, and in practice, |AFF| ≪ n2. Our empirical evaluations verify that our algorithm (a) outperforms the best known link-update algorithm [12], and (b) runs much faster than its batch counterpart when link updates are small.
Keywords :
approximation theory; computational complexity; graph theory; information retrieval; learning (artificial intelligence); singular value decomposition; O(Kn2) time; O(r4n2) time estimation; SVD; SimRank update matrix; fast incremental SimRank; fast incremental algorithm; fast incremental paradigm; graph factorization; hyperlinks; link updates; link-evolving graphs; link-update algorithm; low-rank approximation; node-pair similarities; node-pair similarity measure; pruning technique; rank-one Sylvester matrix equation; singular value decomposition; Accuracy; Approximation methods; Equations; Heuristic algorithms; Matrix converters; Matrix decomposition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816660
Filename :
6816660
Link To Document :
بازگشت