DocumentCode :
2450539
Title :
Unibus: Aspects of heterogeneity and fault tolerance in cloud computing
Author :
Slawinska, Magdalena ; Slawinski, Jaroslaw ; Sunderam, Vaidy
Author_Institution :
Dept. of Math & Comput. Sci., Emory Univ., Atlanta, GA, USA
fYear :
2010
fDate :
19-23 April 2010
Firstpage :
1
Lastpage :
10
Abstract :
The paper describes our on-going project, termed Unibus, in the context of facilitating fault-tolerant executions of MPI applications on computing chunks in the cloud. In general, Unibus focuses on resource access virtualization and automatic, user-transparent resource provisioning that simplify use of heterogeneous resources available to users. In this work, we present the key Unibus concepts (the Capability Model, composite operations, mediators, soft and successive conditionings, metaapplications), and demonstrate how to employ Unibus to orchestrate resources provided by a commercial cloud provider into a fault-tolerant platform, capable of executing message passing applications. In order to support fault tolerance we use DMTCP (Distributed MultiThreaded CheckPointing) that enables checkpointing at the user´s level. To demonstrate that the Unibus-created, FT-enabled platform allows to execute MPI applications we ran NAS Parallel Benchmarks and measured the overhead introduced by FT.
Keywords :
Internet; application program interfaces; checkpointing; fault tolerant computing; MPI applications; NAS parallel benchmarks; Unibus; capability model; cloud computing; composite operations; distributed multithreaded checkpointing; fault tolerance; heterogeneity; metaapplications; Application software; Authentication; Checkpointing; Cloud computing; Computer science; Drives; Fault tolerance; Resource virtualization; Software libraries; Web services; cloud computing; fault tolerance; heterogeneity; resource access virtualization; unified access;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6533-0
Type :
conf
DOI :
10.1109/IPDPSW.2010.5470876
Filename :
5470876
Link To Document :
بازگشت