Title :
An efficient partition-based parallel PageRank algorithm
Author :
Manaskasemsak, Bundit ; Rungsawang, Arnon
Author_Institution :
Massive Inf. & Knowledge Eng., Kasetsart Univ., Bangkok, Thailand
Abstract :
PageRank becomes the most well-known re-ranking technique of the search results. By its iterative computational nature, the computation takes much computing time and resource. Researchers have then devoted much attention in studying an efficient way to compute the PageRank scores of a very large Web graph. However, only a few of them focus on large-scale PageRank computation using parallel processing techniques. In this paper, we propose a partition-based parallel PageRank algorithm that can efficiently run on a low-cost parallel environment like the PC cluster. For comparison, we also study the other two known techniques, as well as propose an analytical discussion concerning I/O and synchronization cost, and memory usage. Experimental results with two Web graphs synthesized from the .TH domain and the Stanford WebBase project are very promising.
Keywords :
Internet; information retrieval; parallel algorithms; workstation clusters; PC cluster; Stanford WebBase project; TH domain; Web graph; iterative computational nature; parallel processing technique; partition based parallel PageRank algorithm; reranking technique; Clustering algorithms; Computer architecture; Concurrent computing; Costs; Iterative algorithms; Knowledge engineering; Large-scale systems; Parallel processing; Partitioning algorithms; Web sites;
Conference_Titel :
Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
Print_ISBN :
0-7695-2281-5
DOI :
10.1109/ICPADS.2005.85