• DocumentCode
    625596
  • Title

    A Transparent Collective I/O Implementation

  • Author

    Yongen Yu ; Jingjin Wu ; Zhiling Lan ; Rudd, Douglas H. ; Gnedin, Nickolay Y. ; Kravtsov, Anatoly

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    297
  • Lastpage
    307
  • Abstract
    I/O performance is vital for most HPC applications especially those that generate a vast amount of data with the growth of scale. Many studies have shown that scientific applications tend to issue small and noncontiguous accesses in an interleaving fashion, causing different processes to access overlapping regions. In such scenario, collective I/O is a widely used optimization technique. However, the use of collective I/O deployed in existing MPI implementations is not trivial and sometimes even impossible. Collective I/O is an optimization based on a single collective I/O access. If the data reside in different places (e.g. in different arrays), the application has to maintain a buffer to first combine these data and then perform I/O operations on the buffer rather than the original data pieces. The process is very tedious for application developers. Besides, collective I/O requires the creating of a file view to describe the noncontiguous access patterns and additional coding is needed. Moreover, for the applications with complex data access using dynamic data sizes, it is hard or even impossible to use the file view mechanism to describe the access pattern through derived data types. In this study, we develop a user-level library called transparent collective I/O (TCIO) for application developers to easily incorporate collective I/O optimization into their applications. Preliminary experiments by means of a synthetic benchmark and a real cosmology application demonstrate that the library can significantly reduce the programming efforts required for application developers. Moreover, TCIO delivers better performance at large scales as compared to the existing collective functionality provided by MPI-IO.
  • Keywords
    application program interfaces; optimising compilers; parallel programming; software libraries; HPC application; MPI-IO; TCIO; application development; complex data access; dynamic data sizes; file view mechanism; noncontiguous access patterns; programming efforts; real cosmology application; single collective I/O access; transparent collective I/O optimization; user-level library; Arrays; File systems; Layout; Libraries; Optimization; Programming; Synchronization; Collective I/O; HPC; I/O intensive applications; MPI; One-sided communication; Parallel I/O; Transparent Collective I/O;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.36
  • Filename
    6569820