DocumentCode :
1787504
Title :
MapReduce-Based RESTMD: Enabling Large-Scale Sampling Tasks with Distributed HPC Systems
Author :
Kondikoppa, Praveenkumar ; Platania, Richard ; Seung-Jong Park ; Keyes, Tom ; Jaegil Kim ; Nayong Kim ; Joohyun Kim ; Shuju Bai
Author_Institution :
Center for Comput. & Technol., Louisiana State Univ., Baton Rouge, LA, USA
fYear :
2014
fDate :
3-5 June 2014
Firstpage :
30
Lastpage :
35
Abstract :
A novel implementation of Replica Exchange Statistical Temperature Molecular Dynamics (RESTMD), belonging to a generalized ensemble method and also known as parallel tempering, is presented. Our implementation employs the MapReduce (MR)-based iterative framework for launching RESTMD over high performance computing (HPC) clusters including our test bed system, Cyber-infrastructure for Reconfigurable Optical Networks (CRON) simulating a network-connected distributed system. Our main contribution is a new implementation of STMD plugged into the well-known CHARMM molecular dynamics package as well as the RESTMD implementation powered by the Hadoop that scales out in a cluster and across distributed systems effectively. To address challenges for the use of Hadoop MapReduce, we examined contributing factors on the performance of the proposed framework with various runtime analysis experiments with two biological systems that differ in size and over different types of HPC resources. Many advantages with the use of RESTMD suggest its effectiveness for enhanced sampling, one of grand challenges in a variety of areas of studies ranging from chemical systems to statistical inference. Lastly, with its support for scale-across capacity over distributed computing infrastructure (DCI) and the use of Hadoop for coarse-grained task-level parallelism, MapReduce-based RESTMD represents truly a good example of the next-generation of applications whose provision is increasingly becoming demanded by science gateway projects, in particular, backed by IaaS clouds.
Keywords :
data handling; inference mechanisms; iterative methods; parallel processing; statistical analysis; CHARMM molecular dynamics package; CRON; DCI; Hadoop MapReduce; MapReduce based iterative framework; MapReduce-based RESTMD; STMD; cyber-infrastructure for reconfigurable optical networks; distributed HPC systems; distributed computing infrastructure; generalized ensemble method; high performance computing clusters; large-scale sampling tasks; network-connected distributed system; parallel tempering; replica exchange statistical temperature molecular dynamics; statistical inference; Biological system modeling; Computational modeling; Distributed computing; Logic gates; Parallel processing; Scalability; Temperature distribution; Distributed; MapReduce; RESTMD;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Science Gateways (IWSG), 2014 6th International Workshop on
Conference_Location :
Dublin
Type :
conf
DOI :
10.1109/IWSG.2014.12
Filename :
6882065
Link To Document :
بازگشت