Title :
LDPLFS: Improving I/O Performance without Application Modification
Author :
Wright, S.A. ; Hammond, S.D. ; Pennycook, S.J. ; Miller, I. ; Herdman, J.A. ; Jarvis, S.A.
Author_Institution :
Dept. of Comput. Sci., Univ. of Warwick, Coventry, UK
Abstract :
Input/Output (I/O) operations can represent a significant proportion of run-time when large scientific applications are run in parallel and at scale. In order to address the growing divergence between processing speeds and I/O performance, the Parallel Log-structured File System (PLFS) has been developed by EMC Corporation and the Los Alamos National Laboratory (LANL) to improve the performance of parallel file activities. Currently, PLFS requires the use of either (i) the FUSE Linux Kernel module, (ii) a modified MPI library with a customised ROMIO MPI-IO library, or (iii) an application rewrite to utilise the PLFS API directly. In this paper we present an alternative method of utilising PLFS in applications. This method employs a dynamic library to intercept the low-level POSIX operations and retarget them to use the equivalents offered by PLFS. We demonstrate our implementation of this approach, named LDPLFS, on a set of standard UNIX tools, as well on as a set of standard parallel I/O intensive mini-applications. The results demonstrate almost equivalent performance to a modified build of ROMIO and improvements over the FUSE-based approach. Furthermore, through our experiments we demonstrate decreased performance in PLFS when ran at scale on the Lustre file system.
Keywords :
Linux; application program interfaces; file organisation; message passing; operating system kernels; parallel processing; EMC Corporation; FUSE Linux kernel module; FUSE-based approach; I-O performance improvement; LANL; LDPLFS; Los Alamos National Laboratory; Lustre file system; PLFS API; ROMIO; ROMIO MPI-IO library; UNIX tools; dynamic library; input-output operations; low-level POSIX operations; parallel I-O intensive miniapplications; parallel log-structured file system; scientific applications; Bandwidth; Containers; File systems; Fuses; Libraries; Program processors; Standards; Data Storage Systems; File Systems; High Performance Computing; I/O;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
DOI :
10.1109/IPDPSW.2012.172