• DocumentCode
    712990
  • Title

    Keyword focused web crawler

  • Author

    Agre, Gunjan H. ; Mahajan, Nikita V.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
  • fYear
    2015
  • fDate
    26-27 Feb. 2015
  • Firstpage
    1089
  • Lastpage
    1092
  • Abstract
    Users and uses of internet is growing tremendously these days which causing an extreme trouble and efforts at user side to get web pages searched which are as per concern and relevant to user´s requirement Generally users approach to search web pages from a large available hierarchy of concepts or use a query to browse web pages from available search engine and receive results based on search pattern where few of the results are relevant to search and most of them are not. Web crawler plays an important role in search engine and act as a key element when performance is considered. This paper includes domain engineering concept and keyword driven crawling with relevancy decision mechanism and uses Ontology concepts which ensures the best path for improving crawler´s performance. This paper introduces extraction of URLs based on keyword or search criteria. It extracts URLs for web pages which contains searched keyword in their content and considers such pages only as important and doesn´t download web pages irrelevant to search. It offers high optimality comparing with traditional web crawler and can enhance search efficiency with more accuracy.
  • Keywords
    Internet; Web sites; data mining; ontologies (artificial intelligence); query processing; relevance feedback; search engines; Internet; URL extraction; Web page browsing; Web query; keyword driven crawling; keyword focused Web crawler; ontology concepts; relevancy decision mechanism; search criteria; search engine; search pattern; user requirement; Algorithm design and analysis; Computers; Crawlers; Ontologies; Search engines; Uniform resource locators; Web pages; Web crawler; keyword; knowledge path; ontology; topic specific web crawler;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics and Communication Systems (ICECS), 2015 2nd International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4799-7224-1
  • Type

    conf

  • DOI
    10.1109/ECS.2015.7124749
  • Filename
    7124749