• DocumentCode
    2939535
  • Title

    An Open-source Collaboration Environment for Metagenomics Research

  • Author

    Su, Xiaoquan ; Ma, Yongzheng ; Yang, Hongwei ; Chang, Xingzhi ; Nan, Kai ; Xu, Jian ; Ning, Kang

  • Author_Institution
    Qingdao Inst. of Bioenergy & Bioprocess Technol., Qingdao, China
  • fYear
    2011
  • fDate
    5-8 Dec. 2011
  • Firstpage
    7
  • Lastpage
    14
  • Abstract
    By analyzing metagenomic data from microbial communities, the taxonomical and functional component of hundreds of previously unknown microbial communities have been elucidated in the past few years. However, metagenomic data analyses are both data- and computation-intensive, which require extensive computational power. Most of the current metagenomic data analysis software were designed to be used on a single PC (Personal Computer), which could not match with the fast increasing number of large metagenomic projects´ computational requirements. Therefore, advanced computational environment has to be developed to cope with such needs. In this paper, we proposed an open-source collaboration environment for metagenomic data analysis, which enabled the parallel analysis of multiple metagenomic datasets at the same time. By using this collaboration environment, researchers from different locations could submit their data, collaboratively configure the analysis pipeline, and perform data analysis efficiently. As of now, more than 30 metagenomic data analysis projects have already been conducted based on this environment.
  • Keywords
    biology computing; data analysis; genomics; groupware; microorganisms; parallel processing; public domain software; advanced computational environment; computational requirements; functional component; large metagenomic projects; metagenomic data analyses; metagenomic data analysis software; metagenomics research; multiple metagenomic datasets; open-source collaboration environment; parallel analysis; taxonomical component; unknown microbial communities; Bandwidth; Collaboration; Communities; Data analysis; Pipelines; Servers; Software; collaboration environment; data- and computation-intensive computing; metagenomics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E-Science (e-Science), 2011 IEEE 7th International Conference on
  • Conference_Location
    Stockholm
  • Print_ISBN
    978-1-4577-2163-2
  • Type

    conf

  • DOI
    10.1109/eScience.2011.10
  • Filename
    6123550