DocumentCode :
1516545
Title :
A Novel Method to Select Informative SNPs and Their Application in Genetic Association Studies
Author :
Liao, Bo ; Li, Xiong ; Zhu, Wen ; Cao, Zhi
Author_Institution :
Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
Volume :
9
Issue :
5
fYear :
2012
Firstpage :
1529
Lastpage :
1534
Abstract :
The association studies between complex diseases and single nucleotide polymorphisms (SNPs) or haplotypes have recently received great attention. However, these studies are limited by the cost of genotyping all SNPs. Therefore, it is essential to find a small subset of tag SNPs representing the rest of the SNPs. The presence of linkage disequilibrium between tag SNPs and the disease variant (genotyped or not), may allow fine mapping study. In this paper, we combine a nearest-means classifier (NMC) and ant colony algorithm to select tags. Results show that our method (ACO/NMC) can get a similar prediction accuracy with method BPSO/SVM and is better than BPSO/STAMPA for small data sets. For large data sets, although the prediction accuracy of our method is lower than BPSO/SVM, ACO/ NMC can reach a high accuracy (>;99 percent) in a relatively short time. when the number of tags increases, the time complexity of NMC is nearly linear growth. To find out that the ability of tags to locate disease locus, we simulate a case-control study and use two-locus haplotype analysis to quantitatively assess the power. The result showed that 20 percent of all SNPs selected by NMC have about 10 percent higher power than random tags, on average.
Keywords :
biology computing; diseases; genetics; molecular biophysics; pattern classification; polymorphism; support vector machines; BPSO-STAMPA; BPSO-SVM; ant colony algorithm; case-control study; complex diseases; genetic association; genotyped disease variant; linkage disequilibrium; nearest-means classifier; single nucleotide polymorphisms; two-locus haplotype analysis; Accuracy; Bioinformatics; Diseases; Genomics; Prediction algorithms; Support vector machines; Haplotypes; informative SNP; single nucleotide polymorphism; tag selection.; Algorithms; Genetic Association Studies; Linkage Disequilibrium; Polymorphism, Single Nucleotide; Support Vector Machines;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.70
Filename :
6200260
Link To Document :
بازگشت