• DocumentCode
    752586
  • Title

    Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms

  • Author

    Smith, Alan Jay

  • Author_Institution
    Department of Electrical Engineering and Computer Science and the Lawrence Berkeley Laboratory, University of California
  • Issue
    4
  • fYear
    1981
  • fDate
    7/1/1981 12:00:00 AM
  • Firstpage
    403
  • Lastpage
    417
  • Abstract
    In most large computer installations files are moved between on-line disk and mass storage (tape, integrated mass storage device) either automatically by the system and/or at the direction of the user. In this paper we present and analyze long term file reference data in order to develop a basis for the construction of algorithms for file migration. Specifically, we examine the use of the on-line user (primarily text editor) data sets at the Stanford Linear Accelerator Center (SLAC) computer installation through the analysis of 13 months of file reference data. We find that most files are used very few times. Of those that are used sufficiently frequently that their reference patterns may be examined, we find that: 1) about a third show declining rates of reference during their lifetime, 2) of the remainder, very few (about 5 percent) show correlated interreference intervals, and 3) interreference intervals (in days) appear to be more skewed than would occur with the Bernoulli process. Thus, about two-thirds of all suffi1ciently active files appear to be referenced as a renewal process with a skewed interreference distribution. A large number of other file reference statistics (file lifetimes, interference distributions, moments, means, number of uses/ file, file sizes, file rates of reference, etc.) are computed and presented. Throughout, statistical tests are described and explained. The results of our analysis of file reference patterns are applied in a companion paper to the development and comparative evaluation of file migration algorithms.
  • Keywords
    File migration; mass storage; memory hierarchies; replacement algorithm; time series analysis; Algorithm design and analysis; Application software; Cache storage; Data analysis; Interference; Linear accelerators; Pattern analysis; Statistical analysis; Statistical distributions; Storage automation; File migration; mass storage; memory hierarchies; replacement algorithm; time series analysis;
  • fLanguage
    English
  • Journal_Title
    Software Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-5589
  • Type

    jour

  • DOI
    10.1109/TSE.1981.230843
  • Filename
    1702861