DocumentCode
3121403
Title
ApproxRank: Estimating Rank for a Subgraph
Author
Wu, Yao ; Raschid, Louiqa
Author_Institution
Inst. for Adv. Comput. Studies (UMIACS), Univ. of Maryland, Coll. Park, College Park, MD
fYear
2009
fDate
March 29 2009-April 2 2009
Firstpage
54
Lastpage
65
Abstract
Customized semantic query answering, personalized search, focused crawlers and localized search engines frequently focus on ranking the pages contained within a subgraph of the global Web graph. The challenge for these applications is to compute PageRank-style scores efficiently on the subgraph, i.e., the ranking must reflect the global link structure of the Web graph but it must do so without paying the high overhead associated with a global computation. We propose a framework of an exact solution and an approximate solution for computing ranking on a subgraph. The IdealRank algorithm is an exact solution with the assumption that the scores of external pages are known. We prove that the IdealRank scores for pages in the subgraph converge. Since the PageRank-style scores of external pages may not typically be available, we propose the ApproxRank algorithm to estimate scores for the subgraph. Both IdealRank and ApproxRank represent the set of external pages with an external node L1 and extend the subgraph with links to L1. They also modify the PageRank-style transition matrix with respect to L1. We analyze the L1 distance between IdealRank scores and ApproxRank scores of the subgraph and show that it is within a constant factor of the L1 distance of the external pages (e.g., the true PageRank scores and uniform scores assumed by ApproxRank). We compare ApproxRank and a stochastic complementation approach (SC), a current best solution for this problem, on different types of subgraphs. ApproxRank has similar or superior performance to SC and typically improves on the runtime performance of SC by an order of magnitude or better. We demonstrate that ApproxRank provides a good approximation to PageRank for a variety of subgraphs.
Keywords
Internet; graph theory; query processing; search engines; ApproxRank; IdealRank algorithm; PageRank-style; customized semantic query answering; focused crawlers; global Web graph; localized search engines; personalized search; Application software; Computer applications; Crawlers; Data engineering; Educational institutions; Explosions; Runtime; Search engines; Stochastic processes; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location
Shanghai
ISSN
1084-4627
Print_ISBN
978-1-4244-3422-0
Electronic_ISBN
1084-4627
Type
conf
DOI
10.1109/ICDE.2009.108
Filename
4812391
Link To Document