• DocumentCode
    625600
  • Title

    Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems

  • Author

    Yanlong Yin ; Jibing Li ; Jun He ; Xian-He Sun ; Thakur, Rahul

  • Author_Institution
    Comput. Sci. Dept., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    345
  • Lastpage
    356
  • Abstract
    The performance gap between computing power and the I/O system is ever increasing, and in the meantime more and more High Performance Computing (HPC) applications are becoming data intensive. This study describes an I/O data replication scheme, named Pattern-Direct and Layout-Aware (PDLA) data replication scheme, to alleviate this performance gap. The basic idea of PDLA is replicating identified data access pattern, and saving these reorganized replications with optimized data layouts based on access cost analysis. A runtime system is designed and developed to integrate the PDLA replication scheme and existing parallel I/O system; a prototype of PDLA is implemented under the MPICH2 and PVFS2 environments. Experimental results show that PDLA is effective in improving data access performance of parallel I/O systems.
  • Keywords
    input-output programs; parallel processing; HPC; PDLA data replication scheme; access cost analysis; data access pattern; high performance computing; optimized data layouts; parallel I/O systems; pattern-direct and layout-aware data replication scheme; Computational modeling; Data models; Layout; Optimization; Prototypes; Runtime; System analysis and design; I/O optimization; Parallel I/O; data access pattern; data reorganization; data replication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.114
  • Filename
    6569824