DocumentCode :
3423422
Title :
Hyperlink Classification: A New Approach to Improve PageRank
Author :
Cun-He, Li ; Ke-Qiang, Lv
Author_Institution :
China Univ. of Pet., Dongying
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
274
Lastpage :
277
Abstract :
Hyperlink structure is widely used in the hypertext classification, but it has not been paid enough attention. We propose a hyperlink classification approach to improve PageRank algorithm which is widely used in the link analysis of search engine. The cause of the topic drift problem is analyzed and the hyperlinks are classified according to their creating motivations and effects. The improved PageRank algorithm is implemented on the open source search engine NUTCH in Chinese Internet. The experimental results show that the improved PageRank algorithm performs better than the standard PageRank.
Keywords :
Internet; pattern classification; public domain software; search engines; Chinese Internet; NUTCH; PageRank; hyperlink classification; hypertext classification; link analysis; open source search engine; topic drift problem; Algorithm design and analysis; Application software; Cause effect analysis; Data engineering; Databases; Expert systems; Internet; Petroleum; Robustness; Search engines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
Conference_Location :
Regensburg
ISSN :
1529-4188
Print_ISBN :
978-0-7695-2932-5
Type :
conf
DOI :
10.1109/DEXA.2007.14
Filename :
4312900
Link To Document :
بازگشت