Title :
The URL Search Strategy Based on the Content and Link Analysis
Author :
Zhou, CaiLan ; Sun, Xuan ; Guo, Hongjie
Author_Institution :
Coll. of Comput. Sci. & Technol., WHUT, Wuhan, China
Abstract :
The Web information which influences the topic relevance of URL is analyzed based on the research of the search strategy about the crawler. On this basis, a new URL search algorithm based on the content and link analysis is supplied to us. The experimental results show that the algorithm not only can solve the problem of topic isolated island to increase recall, but also can avoid the phenomenon of the topic drift at the same time.
Keywords :
Internet; content management; query formulation; relevance feedback; search engines; URL search strategy; Web information; content analysis; crawler; link analysis; topic isolated island problem; topic relevance; Algorithm design and analysis; Computer science; Crawlers; Educational institutions; Finance; Information analysis; Internet; Sun; Uniform resource locators; Web pages;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5364502