DocumentCode :
3168360
Title :
An efficient adaptive focused crawler based on ontology learning
Author :
Su, Chang ; Gao, Yang ; Yang, Jianmei ; Luo, Bin
Author_Institution :
State Key Lab. for Novel Software Technol., Nanjing Univ., China
fYear :
2005
fDate :
6-9 Nov. 2005
Abstract :
The enormous growth of the World Wide Web has made it important to perform resource discovery efficiently. Consequently, several new ideas have been proposed; among them a key technique is focused crawling which is able to crawl particular topical portions of the World Wide Web quickly without having to explore all Web pages. In this paper, we present an intelligent focused crawler algorithm in which we embed ontology to evaluate the page´s relevance to the topic. Compared with other algorithms using domain knowledge, our algorithm can evolve the ontology automatically during crawl process. Considering the instinct characteristics of the ontology, propagation has also been imported to accelerate the evolution of the ontology. We applied our approaches in several tasks and provided an empirical evaluation which has shown promising results.
Keywords :
Internet; information retrieval; learning (artificial intelligence); ontologies (artificial intelligence); World Wide Web; adaptive focused crawler; ontology learning; resource discovery; Acceleration; Crawlers; Laboratories; Learning systems; Ontologies; Search engines; Software performance; Web pages; Web search; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems, 2005. HIS '05. Fifth International Conference on
Print_ISBN :
0-7695-2457-5
Type :
conf
DOI :
10.1109/ICHIS.2005.19
Filename :
1587729
Link To Document :
بازگشت