• DocumentCode
    598574
  • Title

    On distributed file tree walk of parallel file systems

  • Author

    LaFon, J. ; Misra, Sudip ; Bringhurst, J.

  • Author_Institution
    New Mexico State Univ., Las Cruces, NM, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Supercomputers generate vast amounts of data, typically organized into large directory hierarchies on parallel file systems. While the supercomputing applications are parallel, the tools used to process them requiring complete directory traversais, are typically serial. We present an algorithm framework and three fully distributed algorithms for traversing large parallel file systems, and performing file operations in parallel. The first algorithm introduces a randomized work-stealing scheduler; the second improves the first with proximity-awareness; and the third improves upon the second by using a hybrid approach. We have tested our implementation on Cielo, a 1.37 petaflop supercomputer at the Los Alamos National Laboratory and its 7 petabyte file system. Test results show that our algorithms execute orders of magnitude faster than state-of-the-art algorithms while achieving ideal load balancing and low communication cost. We present performance insights from the use of our algorithms in production systems at LANL, performing daily file system operations.
  • Keywords
    mainframes; meta data; parallel algorithms; parallel databases; randomised algorithms; resource allocation; software performance evaluation; tree data structures; Cielo; Los Alamos National Laboratory; algorithm framework; complete directory traversals; computer speed 1.37 PFLOPS; distributed file tree; fully distributed algorithms; hybrid approach; ideal load balancing; large directory hierarchies; large parallel file systems; meta data; parallel applications; parallel file operations; petabyte file system; proximity-awareness; randomized work-stealing scheduler; supercomputers; supercomputing applications; Algorithm design and analysis; Heuristic algorithms; Parallel algorithms; Program processors; Servers; Synchronization; File Systems; Metadata; Parallel Algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    2167-4329
  • Print_ISBN
    978-1-4673-0805-2
  • Type

    conf

  • DOI
    10.1109/SC.2012.82
  • Filename
    6468454