DocumentCode
2928001
Title
Intelligent Web Mining Model to Enhance Knowledge Discovery on the Web
Author
Pandey, Sunil Kr ; Mishra, R.B.
Author_Institution
Dept. of Comput. Sci., SMS, Varanasi
fYear
2006
fDate
Dec. 2006
Firstpage
339
Lastpage
343
Abstract
The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers facilitate the process by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. This paper describes some details about the architecture of a fully implemented a multiagent Web search system I-Spider for the Internet. Its architecture is based on autonomous software agents and the paper is focused on the communication among them. The overall system architecture is based on a multi-agent paradigm. Agents collaborate together HTML pages from the World Wide Web and treat them in order to be able to retrieve those pages from subsequent users´ queries. Crawling Agent collaboration is required in order to decide the URLs that should be first retrieved. Subsequent page treatment consists on first filtering the pages so that HTML format is transformed into XML and second indexing them so that information retrieval can be performed online
Keywords
Web sites; data mining; information retrieval; multi-agent systems; search engines; software agents; HTML page; I-Spider; Internet; Web based information retrieval system; Web page; World Wide Web; XML; autonomous software agent; communicating agents; crawling agent collaboration; hyperlinks; indexing; intelligent Web mining; knowledge discovery; multiagent Web search system; search engine; system architecture; Collaboration; Computer architecture; Crawlers; HTML; Information retrieval; Intelligent agent; Service oriented architecture; Web mining; Web pages; Web search; Communicating Agents; Information Retrieval; Multi-Agent Architecture.; Search Engine;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing, Applications and Technologies, 2006. PDCAT '06. Seventh International Conference on
Conference_Location
Taipei
Print_ISBN
0-7695-2736-1
Type
conf
DOI
10.1109/PDCAT.2006.74
Filename
4032203
Link To Document