• DocumentCode
    691520
  • Title

    Architecture Design of Subject-Oriented Web Crawler

  • Author

    Cao Xin ; Zhang Yong ; Zhang Fuyan ; Ni Changbao

  • Author_Institution
    Comput. Sci. & Tech. Dept., Dalian Neusoft Univ. of Inf., Dalian, China
  • fYear
    2013
  • fDate
    6-7 Nov. 2013
  • Firstpage
    174
  • Lastpage
    177
  • Abstract
    In response to the defects of traditional search engines of which it will return a large amount of information when users search keywords, and it´s hard to get the useful information, thus propose the thought of subject division, that is the subject-oriented search engine.Subject crawler is the key and unique components of the subject-based search engine. The structure of the crawler has a significant impact on the speed of Web resources, as well as the multi-machine distributed extended functionality. This paper studies and designed an architecture of subject crawler, which has flexible modular scalability and multi-machine distributed scalability, and elaborated it.
  • Keywords
    Internet; architectural CAD; information retrieval; search engines; Web resources; architecture design; flexible modular scalability; keyword search; multimachine distributed extended functionality; multimachine distributed scalability; subject-oriented Web crawler; subject-oriented search engines; Computer architecture; Crawlers; HTML; Instruction sets; Memory; Search engines; Sockets; Astronomical Image; Image Segmentation; Mutual Information; PCNN;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Engineering Applications, 2013 Fourth International Conference on
  • Conference_Location
    Zhangjiajie
  • Print_ISBN
    978-1-4799-2791-3
  • Type

    conf

  • DOI
    10.1109/ISDEA.2013.444
  • Filename
    6843421