• DocumentCode
    2995392
  • Title

    A Server-Level Adaptive Data Layout Strategy for Parallel File Systems

  • Author

    Song, Huaiming ; Jin, Hui ; He, Jun ; Sun, Xian-He ; Thakur, Rajeev

  • Author_Institution
    R&D Center, Dawning Inf. Ind., Beijing, China
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    2095
  • Lastpage
    2103
  • Abstract
    Parallel file systems are widely used for providing a high degree of I/O parallelism to mask the gap between I/O and memory speed. However, peak I/O performance is rarely attained due to complex data access patterns of applications. Based on the observation that the I/O performance of small requests is often limited by the request service rate, and the performance of large requests is limited by I/O bandwidth, we take into consideration both factors and propose a server-level adaptive data layout strategy. The proposed strategy adopts different stripe sizes for different file servers according to the data access characteristics on each individual server. We let the file servers that can fully utilize bandwidth hold more data, and the file servers that are limited with request service rate hold less data. As a result, heavy-load servers can offload some data accesses to light-load servers for potential improvement of I/O performance. We present a method to measure access cost for each data block and then utilize an equal-depth histogram approach to distributed data blocks across multiple servers adaptively, so as to balance data accesses on all file servers. Analytical and experimental results demonstrate that the proposed server-level adaptive layout strategy can improve I/O performance by as much as 80.3% and is more appropriate for applications with complex data access patterns.
  • Keywords
    bandwidth allocation; file organisation; file servers; parallel processing; I/O bandwidth; I/O parallelism; access cost measurement; bandwidth utilization; complex data access pattern; data access characteristics; distributed data block; equal-depth histogram approach; file server; heavy-load server; light-load server; parallel file system; peak I/O performance; server-level adaptive data layout strategy; server-level adaptive layout strategy; Arrays; Bandwidth; Distributed databases; Histograms; Layout; Optimization; Servers; Server-level adaptive data layout; data layout optimization; equal-depth histogram; parallel file system; variable stripe size;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.246
  • Filename
    6270568