Title :
Entropy-Based Epistasy Search in SNP Case-Control Studies
Author :
Manzour, Amir ; Saraee, Mohammad
Author_Institution :
Isfahan Univ. of Technol., Esfahan
Abstract :
The purpose of gene mapping is to identify the causal genetic regions of a specific phenotype mainly a complex disease. Most complex diseases are believed to have multiple contributing loci often having subtle patterns which make them fairly difficult to find in large datasets. We present and discuss a new criterion called conditional mutual information for association mapping and compare it to the previous criterion which is mutual information from different aspects. Furthermore, algorithms are proposed to find relevance chains. The proposed algorithms are especially in favor of diseases having almost equally contributing regions known as being epistatic. These algorithms are applied to both simulated and real data. The real data represents the genotype-phenotype values for AMD disease. Proposed relevance-chain algorithms have detected some highly associated markers with AMD. C# source files for relevance-chains algorithm are freely available at https://www. sharemation. com/amanzour.
Keywords :
biology computing; data analysis; diseases; entropy; genetics; search problems; AMD disease; SNP case-control study; association mapping; causal genetic regions; complex diseases; conditional mutual information; entropy-based epistasy search; gene mapping; genotype-phenotype values; multiple contributing loci; relevance-chain algorithms; Biology; Computational modeling; Data mining; Diseases; Entropy; Fuzzy systems; Genetic communication; Information theory; Mutual information; Uncertainty;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.272