Title :
Dynamic file striping and data layout transformation on parallel system with fluctuating I/O workload
Author :
Seung Woo Son ; Sehrish, Saba ; Wei-keng Liao ; Oldfield, R. ; Choudhary, Alok
Abstract :
As the number of compute cores on modern parallel machines increases to more than hundreds of thousands, scalable and consistent I/O performance is becoming hard to obtain due to fluctuating file system performance. This fluctuation is often caused by rebuilding RAID disk from hardware failures or concurrent jobs competing for I/O. We present a mechanism that stripes across a dynamically-selected subset of I/O servers with the lightest workload to achieve the best I/O bandwidth available from the system. We implement this mechanism into an I/O software layer that enables memory-to-file data layout transformation and allows transparent file partitioning. File partitioning is a technique that divides data among a set of files and manages file access, making data appear as a single file to users. Experimental results on NERSC´s Hopper indicate that our approach effectively isolates I/O variation on shared systems and improves overall I/O performance significantly.
Keywords :
RAID; disc drives; file organisation; input-output programs; parallel machines; I/O performance; I/O servers; I/O software layer; I/O variation; NERSC´s Hopper; RAID disk; compute cores; concurrent jobs; dynamic file striping; file access; fluctuating I/O workload; fluctuating file system performance; hardware failures; memory-to-file data layout transformation; parallel machines; parallel system; transparent file partitioning; Aggregates; Arrays; Bandwidth; Bismuth; Layout; Libraries; Servers; Collective I/O; File partitioning; Parallel NetCDF;
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
DOI :
10.1109/CLUSTER.2013.6702689