DocumentCode :
2345026
Title :
Cloud-MAQ: The cloud-enabled scalable whole genome reference Assembly application
Author :
Talukder, Asoke K. ; Gandham, Santhosh ; Prahalad, H.A. ; Bhattacharyya, Nitai Pada
Author_Institution :
Geschickten Solutions, Bangalore, India
fYear :
2010
fDate :
6-8 Sept. 2010
Firstpage :
1
Lastpage :
5
Abstract :
Biology problems are in general NP-hard that demands tremendous resource both in terms of time and computing resources. Most of the computing systems developed for quantifying biological objects suffer from such limitations. MAQ (Mapping and Assembly with Quality) is one such popular bioinformatics system developed for whole genome reference assembly - it is designed to handle the challenges related to short sequence reads generated by Illumina sequencing machines, and can support a maximum read length of 63 nucleotides. MAQ is not multithreaded or many core ready - it runs on single CPU and does not scale. Therefore, as the data size increases, it fails to scale efficiently and requires a supercomputer to perform the assembly within a desired time. In this paper we report Cloud-MAQ that uses the cloud computing paradigm to address the NP-hard related challenges of whole genome reference assembly. Through Hadoop and the cloud paradigm MAQ is made parallel and scalable. Also, MAQ functionality has been enhanced to support recent reads from Illumina that are of 76 nucleotides. This cloud-enabled Cloud-MAQ increases the performance of MAQ reference assembly multi-fold.
Keywords :
Internet; bioinformatics; computational complexity; genomics; optimisation; parallel processing; CPU; Cloud-MAQ; Hadoop; Illumina sequencing machine; NP-hard problem; bioinformatic system; cloud computing paradigm; cloud enabled scalable whole genome reference assembly; nucleotide; Assembly; Bioinformatics; Cloud computing; Clouds; Genomics; Software; Cloud computing; Cloud-MAQ; Computational Quantitative Biology; Hadoop; Whole Genome Reference Assembly;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless And Optical Communications Networks (WOCN), 2010 Seventh International Conference On
Conference_Location :
Colombo
Print_ISBN :
978-1-4244-7203-1
Type :
conf
DOI :
10.1109/WOCN.2010.5587308
Filename :
5587308
Link To Document :
بازگشت