• DocumentCode
    2394874
  • Title

    An efficient partition-based parallel PageRank algorithm

  • Author

    Manaskasemsak, Bundit ; Rungsawang, Arnon

  • Author_Institution
    Massive Inf. & Knowledge Eng., Kasetsart Univ., Bangkok, Thailand
  • Volume
    1
  • fYear
    2005
  • fDate
    20-22 July 2005
  • Firstpage
    257
  • Abstract
    PageRank becomes the most well-known re-ranking technique of the search results. By its iterative computational nature, the computation takes much computing time and resource. Researchers have then devoted much attention in studying an efficient way to compute the PageRank scores of a very large Web graph. However, only a few of them focus on large-scale PageRank computation using parallel processing techniques. In this paper, we propose a partition-based parallel PageRank algorithm that can efficiently run on a low-cost parallel environment like the PC cluster. For comparison, we also study the other two known techniques, as well as propose an analytical discussion concerning I/O and synchronization cost, and memory usage. Experimental results with two Web graphs synthesized from the .TH domain and the Stanford WebBase project are very promising.
  • Keywords
    Internet; information retrieval; parallel algorithms; workstation clusters; PC cluster; Stanford WebBase project; TH domain; Web graph; iterative computational nature; parallel processing technique; partition based parallel PageRank algorithm; reranking technique; Clustering algorithms; Computer architecture; Concurrent computing; Costs; Iterative algorithms; Knowledge engineering; Large-scale systems; Parallel processing; Partitioning algorithms; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
  • ISSN
    1521-9097
  • Print_ISBN
    0-7695-2281-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2005.85
  • Filename
    1531136