DocumentCode
752586
Title
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms
Author
Smith, Alan Jay
Author_Institution
Department of Electrical Engineering and Computer Science and the Lawrence Berkeley Laboratory, University of California
Issue
4
fYear
1981
fDate
7/1/1981 12:00:00 AM
Firstpage
403
Lastpage
417
Abstract
In most large computer installations files are moved between on-line disk and mass storage (tape, integrated mass storage device) either automatically by the system and/or at the direction of the user. In this paper we present and analyze long term file reference data in order to develop a basis for the construction of algorithms for file migration. Specifically, we examine the use of the on-line user (primarily text editor) data sets at the Stanford Linear Accelerator Center (SLAC) computer installation through the analysis of 13 months of file reference data. We find that most files are used very few times. Of those that are used sufficiently frequently that their reference patterns may be examined, we find that: 1) about a third show declining rates of reference during their lifetime, 2) of the remainder, very few (about 5 percent) show correlated interreference intervals, and 3) interreference intervals (in days) appear to be more skewed than would occur with the Bernoulli process. Thus, about two-thirds of all suffi1ciently active files appear to be referenced as a renewal process with a skewed interreference distribution. A large number of other file reference statistics (file lifetimes, interference distributions, moments, means, number of uses/ file, file sizes, file rates of reference, etc.) are computed and presented. Throughout, statistical tests are described and explained. The results of our analysis of file reference patterns are applied in a companion paper to the development and comparative evaluation of file migration algorithms.
Keywords
File migration; mass storage; memory hierarchies; replacement algorithm; time series analysis; Algorithm design and analysis; Application software; Cache storage; Data analysis; Interference; Linear accelerators; Pattern analysis; Statistical analysis; Statistical distributions; Storage automation; File migration; mass storage; memory hierarchies; replacement algorithm; time series analysis;
fLanguage
English
Journal_Title
Software Engineering, IEEE Transactions on
Publisher
ieee
ISSN
0098-5589
Type
jour
DOI
10.1109/TSE.1981.230843
Filename
1702861
Link To Document