Title :
Analysis of the Reasons Why Invisible Web Can´t Be Seen and its Effective Retrieval Strategies
Author :
Zhang, Xia ; Zuo, Mingzhang ; Liu, Qiang
Author_Institution :
Dept. of Inf. Technol., Central China Normal Univ., Wuhan
Abstract :
Nowadays, Internet goes deep into human beings\´ daily routines and brings us many remarkable conflicts between mass digital information and our limited capabilities of acquiring them. Search engine thus becomes an important channel of obtaining information. But conventional search engines can index less than 16% of the publicly indexable Webs, and the other 84% are "invisible". Moreover, the public information on the Invisible Web is 400-550 times larger than WWW. Thus users are searching only 0.03% of available pages. Therefore, it\´s urgent to seek efficient methods to solve the retrieval problem of Invisible Web. This thesis elaborates the reasons why Invisible Web is invisible, puts forward several effective strategies of hunting up invisible resources, and expects to bring human beings enlightenment in network information retrieval and utilization.
Keywords :
Internet; information retrieval; search engines; Internet; Web index; World Wide Web; invisible Web; mass digital information; network information retrieval; public information; search engine; Data mining; Humans; Information analysis; Information retrieval; Internet; Physics; Relational databases; Sea surface; Search engines; Web pages;
Conference_Titel :
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-0-7695-3161-8
Electronic_ISBN :
978-0-7695-3161-8
DOI :
10.1109/ICICIC.2008.629