DocumentCode
2185103
Title
Webpage importance analysis using conditional Markov random walk
Author
Liu, Tie-Yan ; Ma, Wei-Ying
Author_Institution
Microsoft Res. Asia, Beijing, China
fYear
2005
fDate
19-22 Sept. 2005
Firstpage
515
Lastpage
521
Abstract
In this paper, we propose a novel method to calculate the Web page importance based on a conditional Markov random walk model. The main assumption in this model is that given the hyperlinks in a Web page, users are not really randomly clicking one of them. Instead, many factors may bias their behaviors, for example, the anchor text, the content relevance and the previous experiences when visiting the Web site that a destination page belongs to. As one of the results, the user might tend to visit those pages in high-quality Web sites with higher probability. To implement this idea, we reformulate the Web graph to be a two-layer structure, and the Web page importance is calculated by conditional random walk in this new Web graph. Experiments on the topic distillation task of TREC 2003 Web track showed that our new method can achieve about 18% improvement on mean average precision (MAP) and 16% on precision at 10 (P@10) over the PageRank algorithm.
Keywords
Markov processes; Web sites; graph theory; random processes; Web graph; Web page importance analysis; conditional Markov random walk; Algorithm design and analysis; Asia; Information retrieval; Internet; Investments; Mining industry; Oceans; Search engines; Web pages; Web search;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2415-X
Type
conf
DOI
10.1109/WI.2005.161
Filename
1517902
Link To Document