DocumentCode :
34660
Title :
A Parallel File System with Application-Aware Data Layout Policies for Massive Remote Sensing Image Processing in Digital Earth
Author :
Lizhe Wang ; Yan Ma ; Zomaya, Albert Y. ; Ranjan, Rajiv ; Dan Chen
Author_Institution :
Inst. of Remote Sensing & Digital Earth, Beijing, China
Volume :
26
Issue :
6
fYear :
2015
fDate :
June 1 2015
Firstpage :
1497
Lastpage :
1508
Abstract :
Remote sensing applications in Digital Earth are overwhelmed with vast quantities of remote sensing (RS) image data. The intolerable I/O burden introduced by the massive amounts of RS data and the irregular RS data access patterns has made the traditional cluster based parallel I/O systems no longer applicable. We propose a RS data object-based parallel file system for remote sensing applications and implement it with the OrangeFS file system. It provides application-aware data layout policies, together with RS data object based data I/O interfaces, for efficient support of various data access patterns of RS applications from the server side. With the prior knowledge of the desired RS data access patterns, HPGFS could offer relevant space-filling curves to organize the sliced 3-D data bricks and distribute them over I/O servers. In this way, data layouts consistent with expected data access patterns could be created to explore data locality and achieve performance improvement. Moreover, the multi-band RS data with complex structured geographical metadata could be accessed and managed as a single data object. Through experiments on remote sensing applications with different access patterns, we have achieved performance improvement of about 30 percent for I/O and 20 percent overall.
Keywords :
file organisation; geophysical image processing; input-output programs; parallel processing; remote sensing; OrangeFS file system; RS data access patterns; RS image data; aware data layout policies application; data I/O interfaces; data access patterns; data layout policies; data locality; digital earth; intolerable I/O burden; massive remote sensing image processing; parallel file system; Distributed databases; Earth; File systems; Layout; Remote sensing; Sensors; Servers; Parallel file system; big data; data-intensive computing; digital earth; parallel I/O; remote sensing image processing;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2014.2322362
Filename :
6824837
Link To Document :
بازگشت