Title :
Single-Snapshot File System Analysis
Author :
Wildani, A. ; Adams, Ian F. ; Miller, Eric L.
Author_Institution :
Storage Syst. Res. Center, Univ. of California, Santa Cruz, Santa Cruz, CA, USA
Abstract :
Metadata snapshots are a common method for gaining insight into file systems due to their small size and relative ease of acquisition. Since they are static, most researchers have used them for relatively simple analyses such as file size distributions and age of files. We hypothesize that it is possible to gain much richer insights into file system and user behavior by clustering features in metadata snapshots and comparing the entropy within clusters to the entropy within natural partitions such as directory hierarchies. We discuss several different methods for gaining deeper insights into metadata snapshots, and show a small proof of concept using data from Los Alamos National Laboratories. In our initial work, we see evidence that it is possible to identify user locality information, traditionally the purview of dynamic traces, using a single static snapshot.
Keywords :
file organisation; human factors; meta data; pattern clustering; Los Alamos National Laboratories; directory hierarchies; feature clustering; file age; file size distributions; metadata snapshots; single-snapshot file system analysis; user behavior; Analytical models; Clustering algorithms; Computational modeling; Correlation; Educational institutions; Entropy; Market research; data mining; storage; workload analysis;
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on
Conference_Location :
San Francisco, CA
DOI :
10.1109/MASCOTS.2013.47