DocumentCode
453801
Title
MPI-Mitten: Enabling Migration Technology in MPI
Author
Du, Cong ; Sun, Xian-He
Author_Institution
Illinois Institute of Technology, USA
Volume
1
fYear
2006
fDate
16-19 May 2006
Firstpage
11
Lastpage
18
Abstract
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents migrationbased proactive fault tolerance, among others, to be applied to MPI applications. In this study, we propose distributed migration protocols with group membership management to support process migration with group changing. We design and implement a process migration enabling MPI library, named MPIMitten, to verify the protocols and enhance current MPI platforms for reliability and usability. MPI-Mitten is based on MPI standard and can be applied to any MPI-2 implementations. Experimental results show the proposed distributed process migration protocols are solid and the MPI-Mitten system is effective and is uniquely supporting migration-based fault tolerance.
Keywords
Application software; Computer science; Fault tolerance; Hardware; Libraries; Parallel processing; Protocols; Sun; Supercomputers; Usability;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2006. CCGRID 06. Sixth IEEE International Symposium on
Conference_Location
Singapore
Print_ISBN
0-7695-2585-7
Type
conf
DOI
10.1109/CCGRID.2006.71
Filename
1630790
Link To Document