DocumentCode :
401782
Title :
Site-granularity topic distillation on the Web by combining content and hyperlink analysis
Author :
Xu, Zhuo-ming ; Cao, Xiao ; Han, A-hong ; Qu, Yu-zhong ; Dong, Yi-sheng
Author_Institution :
Dept. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
Volume :
4
fYear :
2003
fDate :
2-5 Nov. 2003
Firstpage :
2116
Abstract :
Topic distillation on the Web, namely, given a user query to find quality information sources related to the query topic by using hyperlink analysis, has been shown to be useful in Web IR. Based on the analysis of three deficiencies of classical topic distillation algorithm HITS (i.e., failing to meet users´ site-granularity information needs; tending to produce unreasonable results; topic drift), this paper presents an improved model and algorithm named s-HITSc (site-granularity HITS enhanced by content analysis). Given a query topic, the new algorithm can model a neighborhood graph at site granularity, compute the relevance weights of the nodes to the topic with content analysis, and apply weighted I/O operations in its iterative hyperlink analysis. Theoretical analysis and experimental results show that the new algorithm can control topic drift and identify more reasonable and meaningful authority and hub sites on a given query topic.
Keywords :
Internet; graphs; hypermedia; information retrieval; iterative methods; Web IR; content analysis; hypertext induced topic search algorithm; iterative hyperlink analysis; neighborhood graph; post-retrieval process; preretrieval process; query topic; site-granularity topic distillation; topic drift; Algorithm design and analysis; Computer science; Educational institutions; Electronic mail; Failure analysis; Information analysis; Iterative algorithms; Machine learning; Optical computing; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
Type :
conf
DOI :
10.1109/ICMLC.2003.1259855
Filename :
1259855
Link To Document :
بازگشت