Author :
Fedak, Gilles ; He, Haiwu ; Cappello, Franck
Abstract :
Desktop Grids use the computing, network and storage resources from idle desktop PC´s distributed over multiple-LAN´s or the Internet to compute a large variety of resource-demanding distributed applications. While these applications need to access, compute, store and circulate large volumes of data, little attention has been paid to data management in such large-scale, dynamic, heterogeneous, volatile and highly distributed Grids. In most cases, data management relies on ad-hoc solutions, and providing a general approach is still a challenging issue. To address this problem, we propose the BitDew framework, a programmable environment for automatic and transparent data management on computational Desktop Grids. This paper describes the BitDew programming interface, its architecture, and the performance evaluation of its runtime components. BitDew relies on a specific set of meta-data to drive key data management operations, namely life cycle, distribution, placement, replication and fault-tolerance with a high level of abstraction. The Bitdew runtime environment is a flexible distributed service architecture that integrates modular P2P components such as DHT´s for a distributed data catalog and collaborative transport protocols for data distribution. Through several examples, we describe how application programmers and Bitdew users can exploit Bitdew´s features. The performance evaluation demonstrates that the high level of abstraction and transparency is obtained with a reasonable overhead, while offering the benefit of scalability, performance and fault tolerance with little programming cost.
Keywords :
data handling; grid computing; peer-to-peer computing; resource allocation; BitDew programming interface; Internet; LAN; P2P components; automatic data management; collaborative transport protocols; data distribution; desktop grids; distributed data catalog; flexible distributed service architecture; large-scale data management; meta-data; network; programmable environment; resource-demanding distributed applications; storage resources; transparent data management; Computer architecture; Computer networks; Distributed computing; Drives; Environmental management; Fault tolerance; Grid computing; IP networks; Large-scale systems; Runtime;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for