Title :
A data-mining approach for optimizing performance of an incremental crawler
Author :
Bullot, Hadrien ; Gupta, S.K. ; Mohania, M.K.
Author_Institution :
Sch. of Comput. & Commun. Sci., Swiss Fed. Inst. of Technol., Lausanne, Switzerland
Abstract :
Crawlers visit the Web to maintain a local repository of Web pages up to date. We introduce another perspective to build an effective incremental crawler. Based on previous work in this field, we study how we can improve the performance of a crawler using data-mining. The information collected from the users can help the crawler to know which are the popular pages and to revisit them as soon as possible.
Keywords :
Internet; data mining; optimisation; search engines; Web page; data-mining; incremental crawler performance optimization; search engine; Bandwidth; Computer science; Crawlers; Data analysis; Data mining; Databases; Search engines; Uniform resource locators; Web pages; World Wide Web;
Conference_Titel :
Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1932-6
DOI :
10.1109/WI.2003.1241279