• DocumentCode
    2159997
  • Title

    An approach for accessing data from hidden web using intelligent agent technology

  • Author

    Singh, Lavneet ; Sharma, D.K.

  • Author_Institution
    Dept. of CEA, GLA Univ., Mathura, India
  • fYear
    2013
  • fDate
    22-23 Feb. 2013
  • Firstpage
    800
  • Lastpage
    805
  • Abstract
    There is large amount of information available on web, which is hidden from users. This is because such information is not able to be accessed or indexed by traditional search engines. These search engines are only able to crawl information by following hypertext links. The forms which require login or any authorization process can be ignored by them. Hidden web refers to that deepest part of the Web which is not available for traditional Web crawlers. Obtaining the content from Hidden web is a challenging task. Today many web sites are containing pages that are dynamic in nature. This dynamic nature of web pages creates a problem for retrieving information for traditional web crawlers. The effort done to solve the given problem is discussed in brief. Then, a comparative study among the earlier defined architecture, considering various parameters, is also shown. By analyzing above methods a framework is proposed which uses an intelligent agent technology for accessing the hidden web.
  • Keywords
    Web sites; authorisation; hypermedia; indexing; information retrieval; search engines; software agents; Web crawlers; Web sites; authorization process; data access approach; hidden Web; hypertext links; information crawling; information retrieval problem; intelligent agent technology; search engines; Crawlers; Databases; Feature extraction; Filling; Intelligent agents; Learning (artificial intelligence); Search engines; Hidden Web; Hidden Web Crawling; Hidden Web Databases; Intelligent Agent Technology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference (IACC), 2013 IEEE 3rd International
  • Conference_Location
    Ghaziabad
  • Print_ISBN
    978-1-4673-4527-9
  • Type

    conf

  • DOI
    10.1109/IAdCC.2013.6514329
  • Filename
    6514329