• DocumentCode
    2641295
  • Title

    Analysis of the Reasons Why Invisible Web Can´t Be Seen and its Effective Retrieval Strategies

  • Author

    Zhang, Xia ; Zuo, Mingzhang ; Liu, Qiang

  • Author_Institution
    Dept. of Inf. Technol., Central China Normal Univ., Wuhan
  • fYear
    2008
  • fDate
    18-20 June 2008
  • Firstpage
    563
  • Lastpage
    563
  • Abstract
    Nowadays, Internet goes deep into human beings\´ daily routines and brings us many remarkable conflicts between mass digital information and our limited capabilities of acquiring them. Search engine thus becomes an important channel of obtaining information. But conventional search engines can index less than 16% of the publicly indexable Webs, and the other 84% are "invisible". Moreover, the public information on the Invisible Web is 400-550 times larger than WWW. Thus users are searching only 0.03% of available pages. Therefore, it\´s urgent to seek efficient methods to solve the retrieval problem of Invisible Web. This thesis elaborates the reasons why Invisible Web is invisible, puts forward several effective strategies of hunting up invisible resources, and expects to bring human beings enlightenment in network information retrieval and utilization.
  • Keywords
    Internet; information retrieval; search engines; Internet; Web index; World Wide Web; invisible Web; mass digital information; network information retrieval; public information; search engine; Data mining; Humans; Information analysis; Information retrieval; Internet; Physics; Relational databases; Sea surface; Search engines; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
  • Conference_Location
    Dalian, Liaoning
  • Print_ISBN
    978-0-7695-3161-8
  • Electronic_ISBN
    978-0-7695-3161-8
  • Type

    conf

  • DOI
    10.1109/ICICIC.2008.629
  • Filename
    4603752