• DocumentCode
    572908
  • Title

    Cloud-GSQCT:a parallel approach to screen gene sequences for phylogenetics analysis

  • Author

    Meng, Zhen ; Xiao, Xiao ; Li, Jianhui ; Zhou, Yuanchun ; Cao, Wei ; Shen, Geng

  • Author_Institution
    Sci. Data Center, Comput. Network Inf. Center, Beijing, China
  • fYear
    2012
  • fDate
    24-26 Aug. 2012
  • Firstpage
    660
  • Lastpage
    663
  • Abstract
    Screening data for phylogenetic analysis from large datasets is a known computational problem of data-intensive application. In this paper, we implement a parallel approach, Cloud-GSQCT (Cloud Gene Sequence Quality Control Tool), to screen gene sequence data for phylogenetic analysis, using the MapReduce paradigm to parallelize the solution and to manage its execution. The parallel approach using Hadoop are implemented and the evaluation of the approach is also presented. For download: http://www.darwintree.cn/tools.htm.
  • Keywords
    biology computing; cloud computing; genetics; parallel processing; quality control; Cloud-GSQCT; Hadoop; MapReduce paradigm; cloud gene sequence quality control tool; computational problem; data-intensive application; execution management; gene sequence data screening; gene sequences screening; parallel approach; phylogenetics analysis; Biology; Databases; Hardware; High definition video; Data screening; GSQCT; Hadoop; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Processing (CSIP), 2012 International Conference on
  • Conference_Location
    Xi´an, Shaanxi
  • Print_ISBN
    978-1-4673-1410-7
  • Type

    conf

  • DOI
    10.1109/CSIP.2012.6308940
  • Filename
    6308940