Title :
SNP sets selection under mutual information criterion, application to F7/FVII dataset
Author :
Brunel, H. ; Perera, A. ; Buil, A. ; Sabater-Lleal, M. ; Souto, J.C. ; Fontcuberta, J. ; Vallverdú, M. ; Soria, J.M. ; Caminal, P.
Author_Institution :
Institut de Bioenginyeria de Catalunya, Centre de Recerca en Enginyeria Biomÿdica, Departament d´´Enginyeria de Sistemes, AutomÃ\xa0tica i InformÃ\xa0tica industrial, Universitat Politÿcnica de Catalunya, Pau Gargallo 5, 08028 Barcelona, Spain
Abstract :
One of the main goals of human genetics is to find genetic markers related to complex diseases. In blood coagulation process, it is known that genetic variability in F7 gene is the most responsible for observed variations in FVII levels in blood. In this work, we propose a method for selecting sets of Single Nucleotide Polymorphisms (SNPs) significantly correlated with a phenotype (FVII levels). This method employs a feature selection algorithm (variant of Sequential Forward Selection, SFS) based on a criterion of statistical significance of a mutual information functional. This algorithm is applied to a sample of independent individuals from the GAIT project. Main SNPs found by the algorithm are in correspondence with previous results published using family-based techniques.
Keywords :
Blood; Cardiac disease; Cardiovascular diseases; Coagulation; Genetic communication; Genetic mutations; Information theory; Mutual information; Proteins; Statistical analysis; Algorithms; Artificial Intelligence; Cluster Analysis; Databases, Genetic; Factor VII; Genomics; Humans; Models, Genetic; Models, Statistical; Models, Theoretical; Phenotype; Polymorphism, Single Nucleotide;
Conference_Titel :
Engineering in Medicine and Biology Society, 2008. EMBS 2008. 30th Annual International Conference of the IEEE
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-1814-5
Electronic_ISBN :
1557-170X
DOI :
10.1109/IEMBS.2008.4650032