DocumentCode :
166702
Title :
A scalable framework for large-scale analysis of gene-gene interactions
Author :
Ben Haj Hmida, Moez ; Slimani, Yahya
Author_Institution :
Nat. Eng. Sch. of Tunis, Tunis, Tunisia
fYear :
2014
fDate :
22-26 Sept. 2014
Firstpage :
380
Lastpage :
384
Abstract :
Genome-wide association studies (GWAS) have been designed to assess associations of millions of single nucleotide polymorphisms (SNPs) to find genetic variations associated with a particular disease. The huge amount of data produced by GWAS implies a great challenge for data analysis, particularly for combinatorial methods such as Multifactor Dimensionality Reduction (MDR). The MDR approach aims to reduce the dimensionality of multi-locus genotype space to facilitate the identification of genegene interactions. This method can be computationally intensive, especially when more than hundreds polymorphisms need to be evaluated. Such a kind of problems cannot be solved in a reasonable amount of time with conventional computers. Grid computing can address computational problems, like GWAS, enabling software applications to take advantage of widespread computational resources that are managed by diverse organizations in a secure and reliable way. We are, therefore, motivated to extend the MDR software to support distributed execution on available grid resources. In this paper we propose a framework for supporting parallel execution of the MDR method on Grid environments. This framework helps biologists to perform large-scale epistatic interaction in GWAS. Given a large number of SNPs, our framework distributes SNPs combinations over computing nodes to scale-up to large datasets. The experiment results demonstrate that our framework is capable of analyzing 50,000 SNPs datasets.
Keywords :
biology computing; diseases; genetics; genomics; grid computing; MDR software; disease; gene-gene interaction; genetic variation; genome-wide association studies; grid computing; large-scale analysis; large-scale epistatic interaction; multifactor dimensionality reduction; multilocus genotype space; scalable framework; single nucleotide polymorphism; Bioinformatics; Computational modeling; Dispatching; Genetics; Scalability; Software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
Conference_Location :
Madrid
Type :
conf
DOI :
10.1109/CLUSTER.2014.6968785
Filename :
6968785
Link To Document :
بازگشت