• DocumentCode
    228583
  • Title

    URL ordering based performance evaluation of Web crawler

  • Author

    Shoaib, Mohammed ; Maurya, Ajay Kumar

  • Author_Institution
    Fac. of Comput. Sci. & Eng., Shri Ramswaroop Memorial Univ., Lucknow, India
  • fYear
    2014
  • fDate
    1-2 Aug. 2014
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    There are billions of Web pages on World Wide Web which can be accessed via internet. All of us rely on usage of internet for source of information. This source of information is available on web in various forms such as Websites, databases, images, sound, videos and many more. The search results given by search engine are classified on basis of many techniques such as keyword matches, link analysis, or many other techniques. Search engines provide information gathered from their own indexed databases. These indexed databases contain downloaded information from web pages. Whenever a query is provided by user, the information is fetched from these indexed pages. The Web Crawler is used to download and store web pages. Web crawler of these search engines is expert in crawling various Web pages to gather huge source of information. Web Crawler is developed which orders URLs on the basis of their content similarity to a query and structural similarity. Results are provided over five parameters: Top URLs, Precision, Content, Structural and Total Similarity for a keyword.
  • Keywords
    Web sites; database indexing; query processing; search engines; Internet; URL ordering; Web crawler; Web pages; Web sites; World Wide Web; content similarity; database indexing; information source; keyword matches; link analysis; performance evaluation; search engines; structural similarity; top URLs; total similarity; Cloud computing; Distributed databases; Medical services; Patient monitoring; Schedules; URL Ordering; Web Crawler; Web Pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Engineering and Technology Research (ICAETR), 2014 International Conference on
  • Conference_Location
    Unnao
  • ISSN
    2347-9337
  • Type

    conf

  • DOI
    10.1109/ICAETR.2014.7012962
  • Filename
    7012962