DocumentCode :
712990
Title :
Keyword focused web crawler
Author :
Agre, Gunjan H. ; Mahajan, Nikita V.
Author_Institution :
Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
fYear :
2015
fDate :
26-27 Feb. 2015
Firstpage :
1089
Lastpage :
1092
Abstract :
Users and uses of internet is growing tremendously these days which causing an extreme trouble and efforts at user side to get web pages searched which are as per concern and relevant to user´s requirement Generally users approach to search web pages from a large available hierarchy of concepts or use a query to browse web pages from available search engine and receive results based on search pattern where few of the results are relevant to search and most of them are not. Web crawler plays an important role in search engine and act as a key element when performance is considered. This paper includes domain engineering concept and keyword driven crawling with relevancy decision mechanism and uses Ontology concepts which ensures the best path for improving crawler´s performance. This paper introduces extraction of URLs based on keyword or search criteria. It extracts URLs for web pages which contains searched keyword in their content and considers such pages only as important and doesn´t download web pages irrelevant to search. It offers high optimality comparing with traditional web crawler and can enhance search efficiency with more accuracy.
Keywords :
Internet; Web sites; data mining; ontologies (artificial intelligence); query processing; relevance feedback; search engines; Internet; URL extraction; Web page browsing; Web query; keyword driven crawling; keyword focused Web crawler; ontology concepts; relevancy decision mechanism; search criteria; search engine; search pattern; user requirement; Algorithm design and analysis; Computers; Crawlers; Ontologies; Search engines; Uniform resource locators; Web pages; Web crawler; keyword; knowledge path; ontology; topic specific web crawler;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics and Communication Systems (ICECS), 2015 2nd International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4799-7224-1
Type :
conf
DOI :
10.1109/ECS.2015.7124749
Filename :
7124749
Link To Document :
بازگشت