DocumentCode :
3063469
Title :
The Crawler of Specific Resources Recognition Based on Multi-thread
Author :
Ke, Ming ; Zhang, PengZhou ; Chen, Guowei
Author_Institution :
New Media Inst., Commun. Univ. of China, Beijing, China
fYear :
2012
fDate :
23-26 June 2012
Firstpage :
569
Lastpage :
572
Abstract :
With the development of computer network and widely used of Internet, online information increases in broadband level exponentially, the difficulty and complexity of information retrieval also increase gradually, so the Crawler is developing rapidly. Crawler is a program that can auto collect information from internet. In this paper, we design and implement a multi-thread Crawler for specific resources. This Crawler has features of high accuracy, strong adaptability and high efficiency. Experiment results prove these.
Keywords :
Internet; information retrieval; search engines; Internet; broadband level; computer network; information retrieval; multithread Crawler; specific resources recognition; Accuracy; Crawlers; Data mining; Databases; Educational institutions; Instruction sets; Knowledge engineering; URL filtering; crawler; information extraction; multithreads;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Sciences and Optimization (CSO), 2012 Fifth International Joint Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4673-1365-0
Type :
conf
DOI :
10.1109/CSO.2012.130
Filename :
6274791
Link To Document :
بازگشت