DocumentCode :
1574073
Title :
Intelligent spider for Internet searching
Author :
Chen, Hsinchun ; Chung, Yi-Ming ; Ramsey, Marshall ; Yang, Christopher C. ; Ma, Pai-Chun ; Yen, Jerome
Author_Institution :
Arizona Univ., Tucson, AZ, USA
Volume :
4
fYear :
1997
Firstpage :
178
Abstract :
As World Wide Web (WWW) based Internet services become more popular, information overload also becomes a pressing research problem. Difficulties with searching on the Internet get worse as the amount of information that is available increases. A scalable approach to support Internet search is critical to the success of Internet services and other current or future national information infrastructure (NII) applications. A new approach to build an intelligent personal spider (agent), which is based on automatic textual analysis of Internet documents, is proposed. Best first search and genetic algorithm have been tested to develop the intelligent spider. These personal spiders are able to dynamically and intelligently analyze the contents of the users´ selected homepages as the starting point to search for the most relevant homepages based on the links and indexing. An intelligent spider must have the capability to make adjustments according to progress of searching in order to be an intelligent agent. However, the current searching engines do not have communication between the users and the robots. The spider presented in the paper uses Java to develop the user interface such that the users can adjust the control parameters according to the progress and observe the intermediate results. The performances of the genetic algorithm based and best first search based spiders are also reported
Keywords :
Internet; information retrieval; online front-ends; software agents; word processing; Internet documents; Internet searching; Internet services; Java; World Wide Web based Internet services; automatic textual analysis; best first search; control parameters; genetic algorithm; homepages; information overload; intelligent agent; intelligent personal spider; national information infrastructure; scalable approach; searching engines; user interface; Genetic algorithms; Indexing; Intelligent agent; Intelligent robots; Pressing; Testing; Text analysis; Web and internet services; Web sites; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1997, Proceedings of the Thirtieth Hawaii International Conference on
Conference_Location :
Wailea, HI
ISSN :
1060-3425
Print_ISBN :
0-8186-7743-0
Type :
conf
DOI :
10.1109/HICSS.1997.663379
Filename :
663379
Link To Document :
بازگشت