Title :
File system measurements and their application to the design of efficient operation logging algorithms
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
fDate :
30 Sep-2 Oct 1991
Abstract :
File system operation in a transparently fault-tolerant system that uses checkpointing and message logging is discussed. Logging messages to disk is one of the primary performance costs of such systems. The author has measured the file system operations performed on large timesharing systems running Unix in terms of the level of concurrency (number of consecutive operations that do not change the state of the file system). By performing much of the data analysis online within a modified Unix kernel, statistics were collected over a long period of time with a substantial variation in system load. Using this data, it is demonstrated that a technique called null logging can reduce the number of messages logged to disk by a factor of 10 to 25, depending on the workload. This reduces the overhead of the fault-tolerance mechanism and allows a large fraction of file system operations to commit instantaneously
Keywords :
fault tolerant computing; file organisation; time-sharing systems; Unix; checkpointing; concurrency; file system measurements; message logging; null logging; operation logging algorithms; performance costs; timesharing systems; transparently fault-tolerant system; Algorithm design and analysis; Application software; Checkpointing; Computer science; Concurrent computing; Costs; Fault tolerance; Fault tolerant systems; File systems; Kernel;
Conference_Titel :
Reliable Distributed Systems, 1991. Proceedings., Tenth Symposium on
Conference_Location :
Pisa
Print_ISBN :
0-8186-2260-1
DOI :
10.1109/RELDIS.1991.145400