DocumentCode :
2394874
Title :
An efficient partition-based parallel PageRank algorithm
Author :
Manaskasemsak, Bundit ; Rungsawang, Arnon
Author_Institution :
Massive Inf. & Knowledge Eng., Kasetsart Univ., Bangkok, Thailand
Volume :
1
fYear :
2005
fDate :
20-22 July 2005
Firstpage :
257
Abstract :
PageRank becomes the most well-known re-ranking technique of the search results. By its iterative computational nature, the computation takes much computing time and resource. Researchers have then devoted much attention in studying an efficient way to compute the PageRank scores of a very large Web graph. However, only a few of them focus on large-scale PageRank computation using parallel processing techniques. In this paper, we propose a partition-based parallel PageRank algorithm that can efficiently run on a low-cost parallel environment like the PC cluster. For comparison, we also study the other two known techniques, as well as propose an analytical discussion concerning I/O and synchronization cost, and memory usage. Experimental results with two Web graphs synthesized from the .TH domain and the Stanford WebBase project are very promising.
Keywords :
Internet; information retrieval; parallel algorithms; workstation clusters; PC cluster; Stanford WebBase project; TH domain; Web graph; iterative computational nature; parallel processing technique; partition based parallel PageRank algorithm; reranking technique; Clustering algorithms; Computer architecture; Concurrent computing; Costs; Iterative algorithms; Knowledge engineering; Large-scale systems; Parallel processing; Partitioning algorithms; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
ISSN :
1521-9097
Print_ISBN :
0-7695-2281-5
Type :
conf
DOI :
10.1109/ICPADS.2005.85
Filename :
1531136
Link To Document :
بازگشت