• DocumentCode
    3239134
  • Title

    DDGrid: A Grid Computing Environment with Massive Concurrency and Fault-Tolerance Support

  • Author

    Wang, Yongjian ; Luan, Zhongzhi ; Qian, Depei ; Huang, Yuanqiang ; Chen, Ting ; Han, Biao ; Ren, Yinan ; Yu, Kunqian ; Jiang, Hualiang

  • Author_Institution
    Sino-German Joint Software Inst., Beihang Univ., Beijing
  • fYear
    2008
  • fDate
    24-26 Oct. 2008
  • Firstpage
    5
  • Lastpage
    14
  • Abstract
    Grid Computing is an effective computing paradigm widely used in solving complex problems. There are a variety of existing grid middleware systems which support operation of grid infrastructures, including CNGrid GOS, EGEE gLite, Globus Toolkit, and OSG Condor etc. These grid infrastructures focus on encapsulating underlying computing and storage resources and providing necessary basic services such as batch job service, information service, scheduling service, and cross-domain security, etc. Some other features such as fault-tolerance, massive concurrency support are vital to the success of real applications, especially complex and long running applications. These features have not been the focus point of the current grid systems. DDGrid, a key project supported by CNGrid (China National Grid), is aiming at establishing a grid computing environment that can utilize computing resources scattered over the Internet to carry out virtual-screening operations which requires computing power that a single institute or company can´t afford. In our design and implementation of DDGrid, we propose a master/worker mode which effectively utilizes computing resources that the underlying grid infrastructure provides and tries to provide additional features of fault-tolerance and massive concurrency support that are essential to the real applications.
  • Keywords
    concurrency control; grid computing; middleware; scheduling; software fault tolerance; CNGrid GOS; China National Grid; DDGrid; EGEE gLite; Globus Toolkit; OSG Condor; batch job service; cross domain security; current grid systems; fault tolerance; fault-tolerance support; grid computing environment; grid infrastructure; grid middleware systems; information service; massive concurrency support; scheduling service; Concurrent computing; Fault tolerance; Grid computing; Information security; Internet; Middleware; National security; Processor scheduling; Scattering; Secure storage; Drug Discovery; Fault-tolerance; Massive Concurrency support; Master/Worker;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Grid and Cooperative Computing, 2008. GCC '08. Seventh International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-0-7695-3449-7
  • Type

    conf

  • DOI
    10.1109/GCC.2008.27
  • Filename
    4662836