DocumentCode :
1249584
Title :
Analysis of repair algorithms for mirrored-disk systems
Author :
Kari, Hannu H. ; Saikkonen, Heikki K. ; Park, Nohpill ; Lombardi, Fabrizio
Author_Institution :
Nokia Telecommun., Helsinki, Finland
Volume :
46
Issue :
2
fYear :
1997
fDate :
6/1/1997 12:00:00 AM
Firstpage :
193
Lastpage :
200
Abstract :
This paper analyzes the effects of several disk-repair algorithms (DRA) for a mirrored disk subsystem (RAID-1). The main interest is in disk faults and how the repair-process copies data, for user requests, `from a fault-free disk to a spare disk´ with the least performance-degradation. This study compares how various DRA affect system performance. Two DRA are compared and two access patterns (uniform and nonuniform) are studied to establish their effects on the repair process and performance. Sector faults are repaired using the reassign block facility in the SCSI protocol. When the `mean load of the disk subsystem is moderate´ and the `sector repair time is of the same order of magnitude as the mean disk request processing time´, then the differences between various DRA are minor. Simulation results indicate that the performance degradation of user disk requests can be reduced by introducing a short delay in the repair algorithm, A new algorithm (DRA 3) for detecting sector faults is presented. It scans the disk space, while no user disk-requests are issued, and using the advanced statistics of SCSI disks detects deteriorated media. Its advantage is that it can repair the disk subsystem before data are actually lost due to a media defect
Keywords :
magnetic storage; reliability; storage management; RAID-1; SCSI protocol; data copying; deteriorated media detection; disk faults; disk subsystem repair; disk-repair algorithms; fault tolerance; fault-free disk; least performance-degradation; mass memory; mean disk request processing time; mirrored-disk systems; nonuniform access patterns; reassign block facility; sector faults detection; sector repair time; spare disk; uniform access patterns; Access protocols; Algorithm design and analysis; Central Processing Unit; Degradation; Delay; Error correction codes; Fault tolerant systems; Performance analysis; Statistics; System performance;
fLanguage :
English
Journal_Title :
Reliability, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9529
Type :
jour
DOI :
10.1109/24.589946
Filename :
589946
Link To Document :
بازگشت