Title : 
CEO a cloud epistasis computing model in GWAS
         
        
            Author : 
Wang, Zhengkui ; Wang, Yue ; Tan, Kian-Lee ; Wong, Limsoon ; Agrawal, Divyakant
         
        
            Author_Institution : 
NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, Singapore
         
        
        
        
        
        
            Abstract : 
The 1000 Genome project has made available a large number of single nucleotide polymorphisms (SNPs) for genome-wide association studies (GWAS). However, the large number of SNPs has also rendered the discovery of epistatic interactions of SNPs computationally expensive. Parallelizing the computation offers a promising solution. In this paper, we propose a cloud-based epistasis computing (CEO) model that examines all k-locus SNPs combinations to find statistically significant epistatic interactions efficiently. Our CEO model uses the MapReduce framework which can be executed both on user´s own clusters or on a cloud environment. Our cloud-based solution offers elastic computing resources to users, and more importantly, makes our approach affordable and available to all end-users. We evaluate our CEO model on a cluster of more than 40 nodes. Our experiment results show that our CEO model is computationally flexible, scalable and practical.
         
        
            Keywords : 
Cloud computing; GWAS; Hadoop; MapReduce;
         
        
        
        
            Conference_Titel : 
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
         
        
            Conference_Location : 
Hong Kong
         
        
            Print_ISBN : 
978-1-4244-8306-8
         
        
        
            DOI : 
10.1109/BIBM.2010.5706542