Title :
Long Term Management of Web Cache for Web Archive
Author :
Ozawa, Ryunosuke ; Uehara, Minoru
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyo Univ., Kawagoe, Japan
Abstract :
Today, Internet is usually used as large databases. However, Web pages published by an author is often modified and deleted by the author but not by viewers. Therefore, it is difficult for viewers to guarantee the contents of Web pages referred by an official documents. So, in this paper, we propose a system that guarantees Web page existence. This system checks whether the contents of a page accessed at a time is the same or not. It does not need to collect all pages and can guarantee only pages accessed at once. In this system, the log of cache server is always monitored and cached pages are copied from cache to archive. Furthermore, a user can search pages by a query specified with a time and gets the contents created/modified at that time.
Keywords :
Internet; Web sites; cache storage; content management; document handling; information retrieval systems; query processing; Internet; Web archive; Web cache; Web page content; Web page existence; cache server log; large databases; long term management; official documents; query; Databases; Engines; Internet; Search engines; Servers; Web pages; Web Archive; Web Cache;
Conference_Titel :
Network-Based Information Systems (NBiS), 2012 15th International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-2331-4
DOI :
10.1109/NBiS.2012.62