• DocumentCode
    262252
  • Title

    Automating Deployment of Customized Scientific Data Analytic Environments on Clouds

  • Author

    Chao Jin ; Wenjun Wu ; Hui Zhang

  • Author_Institution
    State Key Lab. of Software Dev. Environ., Beihang Univ. Beijing, Beijing, China
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    41
  • Lastpage
    48
  • Abstract
    Cloud computing has become a widely used solution for efficiently provisioning computational and storage resources. Meanwhile, it is essential to provide customizable scientific data analytic platforms for researchers to conduct their personalized data intensive analysis. The integration of scientific data analytics and Cloud computing has the potential to improve resource utilization and facilitate the development of scientific researches. This paper proposes an automatic deployment framework for deploying computing environments on Clouds for every customized scientific data analytics. To achieve customization and deployment functionalities, this framework has two major components: customization service and workspace deployment service. Users are allowed to customize their personalized scientific data analytics and required Cloud resources under the customization service. A workspace language is defined in the workspace deployment service to describe the requirements of computing resources and software tools of a scientific data analytics. Workspace deployment service then adopts Chef to deploy corresponding computing environments on Clouds based on the workspace descriptions. We also implement a system based on this automatic deployment framework and present a RNA-seq analysis use case to demonstrate how this framework and its system can be used in practice.
  • Keywords
    cloud computing; data analysis; RNA-seq analysis; automatic deployment framework; cloud computing; customization service; customized scientific data analytic environments; personalized data intensive analysis; resource utilization; software tools; workspace deployment service; Cloud computing; Communities; Data analysis; Pipelines; Software tools; XML; Automatic deployment; Cloud computing; Customization; Scientific data analytics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
  • Conference_Location
    Sydney, NSW
  • Type

    conf

  • DOI
    10.1109/BDCloud.2014.22
  • Filename
    7034764