An approach for accessing data from hidden web using intelligent agent technology

Author

Singh, Lavneet ; Sharma, D.K.

Author_Institution

Dept. of CEA, GLA Univ., Mathura, India

fYear

2013

fDate

22-23 Feb. 2013

Firstpage

800

Lastpage

805

Abstract

There is large amount of information available on web, which is hidden from users. This is because such information is not able to be accessed or indexed by traditional search engines. These search engines are only able to crawl information by following hypertext links. The forms which require login or any authorization process can be ignored by them. Hidden web refers to that deepest part of the Web which is not available for traditional Web crawlers. Obtaining the content from Hidden web is a challenging task. Today many web sites are containing pages that are dynamic in nature. This dynamic nature of web pages creates a problem for retrieving information for traditional web crawlers. The effort done to solve the given problem is discussed in brief. Then, a comparative study among the earlier defined architecture, considering various parameters, is also shown. By analyzing above methods a framework is proposed which uses an intelligent agent technology for accessing the hidden web.

Keywords

Web sites; authorisation; hypermedia; indexing; information retrieval; search engines; software agents; Web crawlers; Web sites; authorization process; data access approach; hidden Web; hypertext links; information crawling; information retrieval problem; intelligent agent technology; search engines; Crawlers; Databases; Feature extraction; Filling; Intelligent agents; Learning (artificial intelligence); Search engines; Hidden Web; Hidden Web Crawling; Hidden Web Databases; Intelligent Agent Technology;

fLanguage

English

Publisher

ieee

Conference_Titel

Advance Computing Conference (IACC), 2013 IEEE 3rd International

Conference_Location

Ghaziabad

Print_ISBN

978-1-4673-4527-9

Type

conf

DOI

10.1109/IAdCC.2013.6514329

Filename

6514329