Title :
Intelligent Web Mining Model to Enhance Knowledge Discovery on the Web
Author :
Pandey, Sunil Kr ; Mishra, R.B.
Author_Institution :
Dept. of Comput. Sci., SMS, Varanasi
Abstract :
The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers facilitate the process by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. This paper describes some details about the architecture of a fully implemented a multiagent Web search system I-Spider for the Internet. Its architecture is based on autonomous software agents and the paper is focused on the communication among them. The overall system architecture is based on a multi-agent paradigm. Agents collaborate together HTML pages from the World Wide Web and treat them in order to be able to retrieve those pages from subsequent users´ queries. Crawling Agent collaboration is required in order to decide the URLs that should be first retrieved. Subsequent page treatment consists on first filtering the pages so that HTML format is transformed into XML and second indexing them so that information retrieval can be performed online
Keywords :
Web sites; data mining; information retrieval; multi-agent systems; search engines; software agents; HTML page; I-Spider; Internet; Web based information retrieval system; Web page; World Wide Web; XML; autonomous software agent; communicating agents; crawling agent collaboration; hyperlinks; indexing; intelligent Web mining; knowledge discovery; multiagent Web search system; search engine; system architecture; Collaboration; Computer architecture; Crawlers; HTML; Information retrieval; Intelligent agent; Service oriented architecture; Web mining; Web pages; Web search; Communicating Agents; Information Retrieval; Multi-Agent Architecture.; Search Engine;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2006. PDCAT '06. Seventh International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7695-2736-1
DOI :
10.1109/PDCAT.2006.74