Title :
CEO a cloud epistasis computing model in GWAS
Author :
Wang, Zhengkui ; Wang, Yue ; Tan, Kian-Lee ; Wong, Limsoon ; Agrawal, Divyakant
Author_Institution :
NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, Singapore
Abstract :
The 1000 Genome project has made available a large number of single nucleotide polymorphisms (SNPs) for genome-wide association studies (GWAS). However, the large number of SNPs has also rendered the discovery of epistatic interactions of SNPs computationally expensive. Parallelizing the computation offers a promising solution. In this paper, we propose a cloud-based epistasis computing (CEO) model that examines all k-locus SNPs combinations to find statistically significant epistatic interactions efficiently. Our CEO model uses the MapReduce framework which can be executed both on user´s own clusters or on a cloud environment. Our cloud-based solution offers elastic computing resources to users, and more importantly, makes our approach affordable and available to all end-users. We evaluate our CEO model on a cluster of more than 40 nodes. Our experiment results show that our CEO model is computationally flexible, scalable and practical.
Keywords :
Cloud computing; GWAS; Hadoop; MapReduce;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-8306-8
DOI :
10.1109/BIBM.2010.5706542