DocumentCode :
3145766
Title :
DPDNS Keynote
Author :
Cappello, Franck
Author_Institution :
INRIA, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
1487
Lastpage :
1487
Abstract :
Summary form only given. In this talk, we will explore some recent results concern ing the execution of MPI applications on unstable environments. We will show that by extracting the fundamental characteristics of HPC application, we can design new fault tolerance approaches surpassing existing approaches. In particular, we will present a characterization of HPC applications and the design of a new family of fault tolerance protocols mixing the benefit of coordinated checkpointing and message logging protocols.
Keywords :
cloud computing; fault tolerant computing; message passing; MPI applications; cloud environment; coordinated checkpointing; exascale environment; fault tolerance protocols; high performance computing; hostile environments; message logging; unstable environments;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-61284-425-1
Type :
conf
DOI :
10.1109/IPDPS.2011.410
Filename :
6009005
Link To Document :
بازگشت