DocumentCode
559694
Title
Parallel CNV detection algorithm based on cloud computing
Author
Hong, Sang-kyun ; Yoon, Jee-hee ; Hong, Dong-wan ; Lee, Un-joo ; Bleazard, Thomas
Author_Institution
Dept. of Comput. Eng., Hallym Univ., Chuncheon, South Korea
fYear
2011
fDate
24-26 Oct. 2011
Firstpage
286
Lastpage
289
Abstract
Recently, the cost of whole-genome sequencing has decreased dramatically due to the development of next-generation sequencing technology, and a huge amount of sequencing data has been generated and released by research laboratories worldwide. However, it is difficult to develop mature genome analysis software and high-performance computing resources which are available to assay genome data in real time. In this paper, we propose a parallel and robust CNV detection algorithm to run on a cloud computing environment. The proposed method, which we call CNV shape was developed using a shape-based CNV detection algorithm with Map/Reduce framework. This method finds regions above a certain length with continuously increased or decreased read coverage generated by mapping sequencing data onto the human reference genome. In order to maintain load balancing of each node in the cloud computing environment, we use a partitioning method. Also, we demonstrate the efficiency of the proposed method for CNV detection using publicly available sequencing data.
Keywords
biology computing; cloud computing; genetics; parallel algorithms; resource allocation; Map/Reduce framework; cloud computing environment; genome analysis software; high-performance computing resources; load balancing; next-generation sequencing technology; parallel CNV detection; partitioning method; robust CNV detection; shape-based CNV detection; whole-genome sequencing; Bioinformatics; Biological cells; Cloud computing; Detection algorithms; Genomics; Humans; Shape;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Conference_Location
Macao
Print_ISBN
978-1-4673-0231-9
Type
conf
Filename
6108445
Link To Document