Title :
USVM: Selection of SNPs in Diseases Association Study Using UMDA and SVM
Author :
Wei, Bin ; Peng, Qinke ; Li, Jing ; Kang, Xuejiao ; Li, Chenyao
Author_Institution :
Syst. Eng. Inst., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
With the rapid development of high-throughput genotyping technologies, more and more attentions are paid to the disease association study identifying DNA variations that are highly associated with a specific disease. One main challenge for this study is to find the optimal subsets of Single Nucleotide Polymorphisms (SNPs) which are most tightly associated with diseases. Feature selection has become a necessity in many bioinformatics applications. In this paper, we propose a wrapper algorithm named USVM which combines Univariate Marginal Distribution Algorithm (UMDA) and Support Vector Machine (SVM) for disease association study. USVM not only eliminates the redundancy of feature, but also solves the problem of SVM´s parameters selection. We use USVM to analyze the Crohn´s disease (CD) dataset including 387 samples and each one has 103 SNPs. The experimental results show that our algorithm outperforms the current algorithms including DNF, CSP, ORF and so on.
Keywords :
DNA; biology computing; diseases; genetics; medical computing; molecular biophysics; molecular configurations; support vector machines; Crohn´s disease; DNA variations; SNP selection; UMDA; USVM; disease association study; feature selection; high throughput genotyping technologies; single nucleotide polymorphisms; support vector machine; univariate marginal distribution algorithm; wrapper algorithm; Bioinformatics; Cancer; DNA; Diseases; Genetics; Genomics; Laboratories; Manufacturing systems; Support vector machines; Systems engineering and theory;
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
DOI :
10.1109/ICBBE.2010.5514774