DocumentCode :
2935993
Title :
Crawling web pages with application in online advertises monitoring system
Author :
Xie Zhengao ; Su Shoubao ; Xu Huali
Author_Institution :
Sch. of Software, Univ. of Sci. & Technol. of China, Hefei, China
Volume :
2
fYear :
2010
fDate :
1-2 Aug. 2010
Firstpage :
157
Lastpage :
160
Abstract :
Due to the forms and features of online advertising, an effective web crawling page method, called `Spider´, is designed and implemented by analyzing the information carriers and script codes of web pages. Drawing on the basis of the search engine techniques, a row of heavy method is proposed by employing the preemptive multi-threading technique. It is used to solve the excessive consumption of system resources and network bandwidth in search on the Internet with the Spider to `crawl´ the duplication of information downloaded.
Keywords :
Internet; advertising data processing; information retrieval; multi-threading; search engines; Internet; Spider; Web crawling page method; multithreading technique; online advertise monitoring system; search engine techniques; HTML; World Wide Web; Internet; Spider; crawling web pages; online advertising;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits,Communications and System (PACCS), 2010 Second Pacific-Asia Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7969-6
Type :
conf
DOI :
10.1109/PACCS.2010.5627009
Filename :
5627009
Link To Document :
بازگشت