DocumentCode
2394874
Title
An efficient partition-based parallel PageRank algorithm
Author
Manaskasemsak, Bundit ; Rungsawang, Arnon
Author_Institution
Massive Inf. & Knowledge Eng., Kasetsart Univ., Bangkok, Thailand
Volume
1
fYear
2005
fDate
20-22 July 2005
Firstpage
257
Abstract
PageRank becomes the most well-known re-ranking technique of the search results. By its iterative computational nature, the computation takes much computing time and resource. Researchers have then devoted much attention in studying an efficient way to compute the PageRank scores of a very large Web graph. However, only a few of them focus on large-scale PageRank computation using parallel processing techniques. In this paper, we propose a partition-based parallel PageRank algorithm that can efficiently run on a low-cost parallel environment like the PC cluster. For comparison, we also study the other two known techniques, as well as propose an analytical discussion concerning I/O and synchronization cost, and memory usage. Experimental results with two Web graphs synthesized from the .TH domain and the Stanford WebBase project are very promising.
Keywords
Internet; information retrieval; parallel algorithms; workstation clusters; PC cluster; Stanford WebBase project; TH domain; Web graph; iterative computational nature; parallel processing technique; partition based parallel PageRank algorithm; reranking technique; Clustering algorithms; Computer architecture; Concurrent computing; Costs; Iterative algorithms; Knowledge engineering; Large-scale systems; Parallel processing; Partitioning algorithms; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
ISSN
1521-9097
Print_ISBN
0-7695-2281-5
Type
conf
DOI
10.1109/ICPADS.2005.85
Filename
1531136
Link To Document