• DocumentCode
    1908255
  • Title

    Partial Disk Failures: Using Software to Analyze Physical Damage

  • Author

    Huang, Hai ; Shin, Kang G.

  • Author_Institution
    IBM TJ Watson Res., Yorktown Heights
  • fYear
    2007
  • fDate
    24-27 Sept. 2007
  • Firstpage
    185
  • Lastpage
    198
  • Abstract
    A good understanding of disk failures is crucial to ensure a reliable storage of data. There have been numerous studies characterizing disk failures under the common assumption that failed disks are generally unusable. Contrary to this assumption, partial disk failures are very common, e.g., caused by a head crash resulting in a small number of inaccessible disk sectors. Nevertheless, the damage can sometimes be catastrophic if the file system meta-data were among the affected sectors. As disk density rapidly increases, the likelihood of losing data also rises. This paper describes our experience in analyzing partial disk failures using the physical locations of damaged disk sectors to assess the extent and characteristics of the damage on disk platter surfaces. Based on our findings, we propose several fault-tolerance techniques to proactively guard against permanent data loss due to partial disk failures.
  • Keywords
    disc storage; fault tolerance; storage management; data storage; file system meta-data; partial disk failures; physical damage; reliable data storage; Computer crashes; Failure analysis; Fault tolerance; File systems; Hard disks; Information retrieval; Magnetic analysis; Magnetic heads; Manufacturing; Portable computers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mass Storage Systems and Technologies, 2007. MSST 2007. 24th IEEE Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-0-7695-3025-3
  • Type

    conf

  • DOI
    10.1109/MSST.2007.4367973
  • Filename
    4367973