DocumentCode
2754311
Title
TC-PageRank Algorithm Based on Topic Correlation
Author
Huang, Decai ; Qi, Huachun ; Yuan, Yuan ; Zheng, Yue-feng
Author_Institution
Coll. of Inf. Eng., Zhejiang Univ. of Technol., Hangzhou
Volume
2
fYear
0
fDate
0-0 0
Firstpage
5943
Lastpage
5946
Abstract
PageRank algorithm is a famous algorithm to mine the Web structure, but it has a drawback of topic-drift. To eliminate the topic-drift of the PageRank algorithm, and after the analysis of existing algorithms, a new algorithm called TC-PageRank algorithm is put forward. The TC-PageRank algorithm is based on fictitious file vector and correlation measure of cosine. Experimental results illustrate that TC-PageRank algorithm eliminates the topic-drift phenomenon effectively, and thus improves the quality of retrieving
Keywords
Internet; correlation methods; data mining; information retrieval; TC-PageRank; Web structure mining; cosine correlation measure; fictitious file vector; topic correlation; topic-drift drawback; Algorithm design and analysis; Automobiles; Classification algorithms; Educational institutions; Internet; Prototypes; Search engines; Turning; Web sites; Hyperlink Analysis; PageRank Algorithm; Topic Correlation; Web Structure Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on
Conference_Location
Dalian
Print_ISBN
1-4244-0332-4
Type
conf
DOI
10.1109/WCICA.2006.1714219
Filename
1714219
Link To Document