• DocumentCode
    2864096
  • Title

    Web Archive System for Efficient Storage of Web History Information

  • Author

    Lee, Moohun ; Cho, Sunghoon ; Choi, Euiin

  • fYear
    2007
  • fDate
    11-13 Oct. 2007
  • Firstpage
    378
  • Lastpage
    381
  • Abstract
    The growth of web has brought convenience for people accessing large amounts of information. Most people depend on the web for obtaining information. Generally, data on the web is updated and deleted by the web server manager, which results in much previous information disappearing from existence regardless of importance. For this reason, a web archive system is studied to efficiently manage valuable data produced over a long period of time. However, the existing web archive system doesn´t support systematic processing and management of data before updating. In addition, storage systems are not efficient when storing large quantities of web information. In this paper, the proposed method uses a special crawler for collecting web history information. The crawler of WebBase can reduce overhead in web page collection. It can store deleting web information using a RCS. Thus, web history information can be stored and accessed efficiently.
  • Keywords
    Books; Crawlers; History; Indexing; Information management; Pervasive computing; Search engines; Software libraries; Web pages; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Pervasive Computing, 2007. IPC. The 2007 International Conference on
  • Conference_Location
    Jeju City
  • Print_ISBN
    978-0-7695-3006-2
  • Type

    conf

  • DOI
    10.1109/IPC.2007.61
  • Filename
    4438458