DocumentCode
2935993
Title
Crawling web pages with application in online advertises monitoring system
Author
Xie Zhengao ; Su Shoubao ; Xu Huali
Author_Institution
Sch. of Software, Univ. of Sci. & Technol. of China, Hefei, China
Volume
2
fYear
2010
fDate
1-2 Aug. 2010
Firstpage
157
Lastpage
160
Abstract
Due to the forms and features of online advertising, an effective web crawling page method, called `Spider´, is designed and implemented by analyzing the information carriers and script codes of web pages. Drawing on the basis of the search engine techniques, a row of heavy method is proposed by employing the preemptive multi-threading technique. It is used to solve the excessive consumption of system resources and network bandwidth in search on the Internet with the Spider to `crawl´ the duplication of information downloaded.
Keywords
Internet; advertising data processing; information retrieval; multi-threading; search engines; Internet; Spider; Web crawling page method; multithreading technique; online advertise monitoring system; search engine techniques; HTML; World Wide Web; Internet; Spider; crawling web pages; online advertising;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits,Communications and System (PACCS), 2010 Second Pacific-Asia Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-7969-6
Type
conf
DOI
10.1109/PACCS.2010.5627009
Filename
5627009
Link To Document