DocumentCode
572908
Title
Cloud-GSQCT:a parallel approach to screen gene sequences for phylogenetics analysis
Author
Meng, Zhen ; Xiao, Xiao ; Li, Jianhui ; Zhou, Yuanchun ; Cao, Wei ; Shen, Geng
Author_Institution
Sci. Data Center, Comput. Network Inf. Center, Beijing, China
fYear
2012
fDate
24-26 Aug. 2012
Firstpage
660
Lastpage
663
Abstract
Screening data for phylogenetic analysis from large datasets is a known computational problem of data-intensive application. In this paper, we implement a parallel approach, Cloud-GSQCT (Cloud Gene Sequence Quality Control Tool), to screen gene sequence data for phylogenetic analysis, using the MapReduce paradigm to parallelize the solution and to manage its execution. The parallel approach using Hadoop are implemented and the evaluation of the approach is also presented. For download: http://www.darwintree.cn/tools.htm.
Keywords
biology computing; cloud computing; genetics; parallel processing; quality control; Cloud-GSQCT; Hadoop; MapReduce paradigm; cloud gene sequence quality control tool; computational problem; data-intensive application; execution management; gene sequence data screening; gene sequences screening; parallel approach; phylogenetics analysis; Biology; Databases; Hardware; High definition video; Data screening; GSQCT; Hadoop; MapReduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Processing (CSIP), 2012 International Conference on
Conference_Location
Xi´an, Shaanxi
Print_ISBN
978-1-4673-1410-7
Type
conf
DOI
10.1109/CSIP.2012.6308940
Filename
6308940
Link To Document