DocumentCode :
683825
Title :
Building localized bioinformatics platform based on Galaxy and high performance computing cluster
Author :
Xiao-Lei Wang ; Jiang-yu Li ; Yang Liu ; Yu-feng Wang ; Dong-sheng Zhao
Author_Institution :
Inst. of Health Service & Med. Inf., Acad. of Mil. Med. Sci., Beijing, China
fYear :
2013
fDate :
16-18 Dec. 2013
Firstpage :
712
Lastpage :
716
Abstract :
With the rapid development of high-throughput sequencing technology, biomedical research has entered into the era of big data. It causes problems about storage and analysis of massive biological data which need to be solved by high-performance computing. Therefore, we build the localized high-performance one-stop data analysis platform to provide convenient and efficient computational analysis services for biomedical researchers. We deploy Galaxy and integrate software tools and datasets into Galaxy in computing cluster, build stable web service, FTP service and management database in order to optimize and improve the performance of Galaxy, and use distributed resource management application interface to collaborate Galaxy with Sun Grid Engine for automatically scheduling and assigning computing resources. Currently the platform has been put into trial operation. The peak performance is 10 Teraflops and the capacity of storage is 40TB. The platform provides many functions such as sequence alignment, short sequence mapping, gene annotation, transcriptome analysis, metagenomic analysis and phylogenetic analysis, and approximately 700GB reference databases including human genome, viruses, bacteria, fungi, etc.
Keywords :
Big Data; Web services; bioinformatics; data analysis; molecular biophysics; parallel processing; resource allocation; scheduling; FTP service; Galaxy; Sun Grid Engine; Web service; bacteria database; big data; biomedical research; computational analysis services; data storage; distributed resource management application; file transfer protocol; fungi database; gene annotation; high performance computing cluster; high-throughput sequencing technology; human genome database; localized bioinformatics platform; localized high-performance one-stop data analysis platform; management database; metagenomic analysis; phylogenetic analysis; resource scheduling; sequence alignment; short sequence mapping; software tools; transcriptome analysis; virus database; Bioinformatics; Data analysis; Genomics; Microorganisms; Software; Visual databases; Bioinformatics; Galaxy; High-performance Computing; Localized; Online analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2760-9
Type :
conf
DOI :
10.1109/BMEI.2013.6747031
Filename :
6747031
Link To Document :
بازگشت