Title :
Research on Prototype Framework of a Multi-Threading Web Crawler for E-Commerce
Author_Institution :
Sch. of Inf. Manage., WuHan Univ., Wuhan, China
Abstract :
Web crawlers facilitate the search engine´s work by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. Crawling is the initial and also the most important step during the Web searching procedure. A prototype framework of a multi-threading Web crawler for E-commerce application is proposed, in relationship to the former research of search engine. And the design and implementation of a multi-threading Web crawler is described and discussed. The experiment result demonstrates this prototype of Web crawler has better performance.
Keywords :
Internet; electronic commerce; information retrieval; multi-threading; search engines; E-commerce; Web page hyperlinks; multithreading Web crawler; search engine; Costs; Crawlers; Hardware; Information management; Network servers; Prototypes; Search engines; Uniform resource locators; Web pages; Web server;
Conference_Titel :
Management and Service Science, 2009. MASS '09. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4638-4
Electronic_ISBN :
978-1-4244-4639-1
DOI :
10.1109/ICMSS.2009.5304437