Title :
DDGrid: A Grid Computing Environment with Massive Concurrency and Fault-Tolerance Support
Author :
Wang, Yongjian ; Luan, Zhongzhi ; Qian, Depei ; Huang, Yuanqiang ; Chen, Ting ; Han, Biao ; Ren, Yinan ; Yu, Kunqian ; Jiang, Hualiang
Author_Institution :
Sino-German Joint Software Inst., Beihang Univ., Beijing
Abstract :
Grid Computing is an effective computing paradigm widely used in solving complex problems. There are a variety of existing grid middleware systems which support operation of grid infrastructures, including CNGrid GOS, EGEE gLite, Globus Toolkit, and OSG Condor etc. These grid infrastructures focus on encapsulating underlying computing and storage resources and providing necessary basic services such as batch job service, information service, scheduling service, and cross-domain security, etc. Some other features such as fault-tolerance, massive concurrency support are vital to the success of real applications, especially complex and long running applications. These features have not been the focus point of the current grid systems. DDGrid, a key project supported by CNGrid (China National Grid), is aiming at establishing a grid computing environment that can utilize computing resources scattered over the Internet to carry out virtual-screening operations which requires computing power that a single institute or company can´t afford. In our design and implementation of DDGrid, we propose a master/worker mode which effectively utilizes computing resources that the underlying grid infrastructure provides and tries to provide additional features of fault-tolerance and massive concurrency support that are essential to the real applications.
Keywords :
concurrency control; grid computing; middleware; scheduling; software fault tolerance; CNGrid GOS; China National Grid; DDGrid; EGEE gLite; Globus Toolkit; OSG Condor; batch job service; cross domain security; current grid systems; fault tolerance; fault-tolerance support; grid computing environment; grid infrastructure; grid middleware systems; information service; massive concurrency support; scheduling service; Concurrent computing; Fault tolerance; Grid computing; Information security; Internet; Middleware; National security; Processor scheduling; Scattering; Secure storage; Drug Discovery; Fault-tolerance; Massive Concurrency support; Master/Worker;
Conference_Titel :
Grid and Cooperative Computing, 2008. GCC '08. Seventh International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-0-7695-3449-7
DOI :
10.1109/GCC.2008.27