DocumentCode
1815176
Title
Choosing SNPs using feature selection
Author
Phuong, Tu Minh ; Lin, Zhen ; Altman, Russ B.
Author_Institution
Dept. of Inf. Technol. Post & Telecom., Inst. of Technol. Hanoi, Vietnam
fYear
2005
fDate
8-11 Aug. 2005
Firstpage
301
Lastpage
309
Abstract
A major challenge for genomewide disease association studies is the high cost of genotyping large number of single nucleotide polymorphisms (SNP). The correlations between SNPs, however, make it possible to select a parsimonious set of informative SNPst known as "tagging" SNPs, able to capture most variation in a population. Considerable research interest has recently focused on the development of methods for finding such SNPs. In this paper, we present an efficient method for finding tagging SNPs. The method does not involve computation-intensive search for SNP subsets but discards redundant SNPs using a feature selection algorithm. In contrast to most existing methods, the method presented here does not limit itself to using only correlations between SNPs in local groups. By using correlations that occur across different chromosomal regions, the method can reduce the number of globally redundant SNPs. Experimental results show that the number of tagging SNPs selected by our method is smaller than by using block-based methods.
Keywords
cellular biophysics; diseases; genetics; medical computing; block-based method; chromosomal region; computation-intensive search; feature selection algorithm; genomewide disease association; genotyping large number; globally redundant SNP; single nucleotide polymorphism; Bioinformatics; Biological cells; Diseases; Genetics; Genomics; Humans; Information technology; Principal component analysis; Size measurement; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2005. Proceedings. 2005 IEEE
Print_ISBN
0-7695-2344-7
Type
conf
DOI
10.1109/CSB.2005.22
Filename
1498031
Link To Document