DocumentCode :
1767126
Title :
Practical distributed computation of maximal exact matches in the cloud
Author :
El-Din, Sondos Seif ; Aboelhoda, Mohamed
Author_Institution :
Center for Inf. Sci., Nile Univ., Giza, Egypt
fYear :
2014
fDate :
1-4 June 2014
Firstpage :
609
Lastpage :
613
Abstract :
Computation of maximal exact matches (MEMs) is an important problem in comparing genomic sequences. Optimal sequential algorithms for computing MEMs have been already introduced and integrated in a number of software tools. To cope with large data and exploit new computing paradigms like cloud computing, it is important to develop efficient and ready-to-use solutions running on distributed parallel architecture. In a previous work, we have introduced a distributed algorithm running on a computer cluster for computing the MEMs. In this paper, we extend this work in two directions: First, we introduce new variants of this algorithm; one of them has a better time complexity than the published one. These variants as we will demonstrate by experiments are faster in practice. Second, we introduce a cloud based implementation, where we automate the process of creating and configuring the cluster, submitting the jobs, and finally collecting the results and terminating the cloud machines.
Keywords :
biology computing; cloud computing; computational complexity; distributed algorithms; genomics; MEM; cloud computing; cloud machines; computer cluster; distributed algorithm; distributed computation; genomic sequences; maximal exact matches; optimal sequential algorithms; time complexity; Arrays; Bioinformatics; Clustering algorithms; Computers; Genomics; Time complexity; Bioinformatics; Cloud Computing; Computer Cluster; String Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical and Health Informatics (BHI), 2014 IEEE-EMBS International Conference on
Conference_Location :
Valencia
Type :
conf
DOI :
10.1109/BHI.2014.6864438
Filename :
6864438
Link To Document :
بازگشت