• DocumentCode
    34660
  • Title

    A Parallel File System with Application-Aware Data Layout Policies for Massive Remote Sensing Image Processing in Digital Earth

  • Author

    Lizhe Wang ; Yan Ma ; Zomaya, Albert Y. ; Ranjan, Rajiv ; Dan Chen

  • Author_Institution
    Inst. of Remote Sensing & Digital Earth, Beijing, China
  • Volume
    26
  • Issue
    6
  • fYear
    2015
  • fDate
    June 1 2015
  • Firstpage
    1497
  • Lastpage
    1508
  • Abstract
    Remote sensing applications in Digital Earth are overwhelmed with vast quantities of remote sensing (RS) image data. The intolerable I/O burden introduced by the massive amounts of RS data and the irregular RS data access patterns has made the traditional cluster based parallel I/O systems no longer applicable. We propose a RS data object-based parallel file system for remote sensing applications and implement it with the OrangeFS file system. It provides application-aware data layout policies, together with RS data object based data I/O interfaces, for efficient support of various data access patterns of RS applications from the server side. With the prior knowledge of the desired RS data access patterns, HPGFS could offer relevant space-filling curves to organize the sliced 3-D data bricks and distribute them over I/O servers. In this way, data layouts consistent with expected data access patterns could be created to explore data locality and achieve performance improvement. Moreover, the multi-band RS data with complex structured geographical metadata could be accessed and managed as a single data object. Through experiments on remote sensing applications with different access patterns, we have achieved performance improvement of about 30 percent for I/O and 20 percent overall.
  • Keywords
    file organisation; geophysical image processing; input-output programs; parallel processing; remote sensing; OrangeFS file system; RS data access patterns; RS image data; aware data layout policies application; data I/O interfaces; data access patterns; data layout policies; data locality; digital earth; intolerable I/O burden; massive remote sensing image processing; parallel file system; Distributed databases; Earth; File systems; Layout; Remote sensing; Sensors; Servers; Parallel file system; big data; data-intensive computing; digital earth; parallel I/O; remote sensing image processing;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2014.2322362
  • Filename
    6824837