DocumentCode
2995392
Title
A Server-Level Adaptive Data Layout Strategy for Parallel File Systems
Author
Song, Huaiming ; Jin, Hui ; He, Jun ; Sun, Xian-He ; Thakur, Rajeev
Author_Institution
R&D Center, Dawning Inf. Ind., Beijing, China
fYear
2012
fDate
21-25 May 2012
Firstpage
2095
Lastpage
2103
Abstract
Parallel file systems are widely used for providing a high degree of I/O parallelism to mask the gap between I/O and memory speed. However, peak I/O performance is rarely attained due to complex data access patterns of applications. Based on the observation that the I/O performance of small requests is often limited by the request service rate, and the performance of large requests is limited by I/O bandwidth, we take into consideration both factors and propose a server-level adaptive data layout strategy. The proposed strategy adopts different stripe sizes for different file servers according to the data access characteristics on each individual server. We let the file servers that can fully utilize bandwidth hold more data, and the file servers that are limited with request service rate hold less data. As a result, heavy-load servers can offload some data accesses to light-load servers for potential improvement of I/O performance. We present a method to measure access cost for each data block and then utilize an equal-depth histogram approach to distributed data blocks across multiple servers adaptively, so as to balance data accesses on all file servers. Analytical and experimental results demonstrate that the proposed server-level adaptive layout strategy can improve I/O performance by as much as 80.3% and is more appropriate for applications with complex data access patterns.
Keywords
bandwidth allocation; file organisation; file servers; parallel processing; I/O bandwidth; I/O parallelism; access cost measurement; bandwidth utilization; complex data access pattern; data access characteristics; distributed data block; equal-depth histogram approach; file server; heavy-load server; light-load server; parallel file system; peak I/O performance; server-level adaptive data layout strategy; server-level adaptive layout strategy; Arrays; Bandwidth; Distributed databases; Histograms; Layout; Optimization; Servers; Server-level adaptive data layout; data layout optimization; equal-depth histogram; parallel file system; variable stripe size;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location
Shanghai
Print_ISBN
978-1-4673-0974-5
Type
conf
DOI
10.1109/IPDPSW.2012.246
Filename
6270568
Link To Document