DocumentCode :
1681802
Title :
Hierarchical Storage Systems and File Formats for Web Archiving
Author :
Kawano, Hiroyuki
Author_Institution :
Dept. of Syst. Design & Eng., Nanzan Univ., Seto, Japan
fYear :
2011
Firstpage :
217
Lastpage :
220
Abstract :
Many national libraries are making efforts to crawl and store various born-digital information, there are many difficult problems of the social, legal and technical aspects. In this paper, from the view points of long-term preservation of digital contents, we focus on the the urgent task of storage system, since the size of the web archive is increasing exponentially. In order to archive monotonously increasing contents, we discuss management of storage devices and file formats in web archive systems. Firstly, we propose an architecture of hierarchical storage system based on characteristics of storage devices and file compression formats. Next, we modify the file moving algorithm by using file access frequency. We also evaluate the performance of our proposed algorithm with predicted data based on actual statistics of a web archive system.
Keywords :
Internet; file organisation; records management; Web archiving; digital contents; digital information; file compression formats; file formats; hierarchical storage systems; national libraries; storage system; Cache memory; DVD; Indexes; Random access memory; Systems engineering and theory; File Formats; File Moving Algorithm; Hierarchical Storage Systems; Storage Management; Web Archive;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Engineering (ICSEng), 2011 21st International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4577-1078-0
Type :
conf
DOI :
10.1109/ICSEng.2011.46
Filename :
6041818
Link To Document :
بازگشت