DocumentCode :
2772257
Title :
Efficient Algorithm for Computing Link-Based Similarity in Real World Networks
Author :
Cai, Yuanzhe ; Cong, Gao ; Jia, Xu ; Liu, Hongyan ; He, Jun ; Lu, Jiaheng ; Du, Xiaoyong
Author_Institution :
Key Labs. of Data Eng. & Knowledge Eng., Minist. of Educ., China
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
734
Lastpage :
739
Abstract :
Similarity calculation has many applications, such as information retrieval, and collaborative filtering, among many others. It has been shown that link-based similarity measure, such as SimRank, is very effective in characterizing the object similarities in networks, such as the Web, by exploiting the object-to-object relationship. Unfortunately, it is prohibitively expensive to compute the link-based similarity in a relatively large graph. In this paper, based on the observation that link-based similarity scores of real world graphs follow the power-law distribution, we propose a new approximate algorithm, namely Power-SimRank, with guaranteed error bound to efficiently compute link-based similarity measure. We also prove the convergence of the proposed algorithm. Extensive experiments conducted on real world datasets and synthetic datasets show that the proposed algorithm outperforms SimRank by four-five times in terms of efficiency while the error generated by the approximation is small.
Keywords :
Internet; data handling; information filtering; Power-SimRank; Web; collaborative filtering; information retrieval; link-based similarity scores; object-to-object relationship; real world networks; synthetic datasets; Collaboration; Computer networks; Computer science; Computer science education; Data engineering; Data mining; Filtering; Helium; Iterative algorithms; Knowledge engineering; Graph Mining; SimRank; Similarity Calculation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.136
Filename :
5360303
Link To Document :
بازگشت