Title :
Parallel CNV detection algorithm based on cloud computing
Author :
Hong, Sang-kyun ; Yoon, Jee-hee ; Hong, Dong-wan ; Lee, Un-joo ; Bleazard, Thomas
Author_Institution :
Dept. of Comput. Eng., Hallym Univ., Chuncheon, South Korea
Abstract :
Recently, the cost of whole-genome sequencing has decreased dramatically due to the development of next-generation sequencing technology, and a huge amount of sequencing data has been generated and released by research laboratories worldwide. However, it is difficult to develop mature genome analysis software and high-performance computing resources which are available to assay genome data in real time. In this paper, we propose a parallel and robust CNV detection algorithm to run on a cloud computing environment. The proposed method, which we call CNV shape was developed using a shape-based CNV detection algorithm with Map/Reduce framework. This method finds regions above a certain length with continuously increased or decreased read coverage generated by mapping sequencing data onto the human reference genome. In order to maintain load balancing of each node in the cloud computing environment, we use a partitioning method. Also, we demonstrate the efficiency of the proposed method for CNV detection using publicly available sequencing data.
Keywords :
biology computing; cloud computing; genetics; parallel algorithms; resource allocation; Map/Reduce framework; cloud computing environment; genome analysis software; high-performance computing resources; load balancing; next-generation sequencing technology; parallel CNV detection; partitioning method; robust CNV detection; shape-based CNV detection; whole-genome sequencing; Bioinformatics; Biological cells; Cloud computing; Detection algorithms; Genomics; Humans; Shape;
Conference_Titel :
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Conference_Location :
Macao
Print_ISBN :
978-1-4673-0231-9