DocumentCode
429473
Title
Finding the best ridge regression subset by genetic algorithms: applications to multilocus quantitative trait mapping
Author
Zhang, Bin ; Horvath, Steve
Author_Institution
Dept. of Human Genetics, California Univ., Los Angeles, CA, USA
Volume
1
fYear
2004
fDate
1-5 Sept. 2004
Firstpage
2793
Lastpage
2796
Abstract
Genetic algorithms (GAs) are increasingly used in large and complex optimization problems. Here we use GAs to optimize fitness functions related to ridge regression, which is a classical statistical procedure for dealing with a large number of features in a multivariable, linear regression setting. The algorithm avoids overfitting, gracefully handles collinearity, and leads to easily interpretable results. We use the method to model the relationship between a quantitative trait and genetic markers in a mouse cross involving 69 F2 mice. The approach will be useful in the context of many genomic data sets where the number of features far exceeds the number of observations and where features can be highly correlated.
Keywords
biology computing; genetic algorithms; genetics; regression analysis; F2 mice; fitness functions; genetic algorithms; genetic markers; genomic data sets; linear regression setting; multilocus quantitative trait mapping; multivariable setting; optimization; ridge regression subset; statistical procedure; Accuracy; Bioinformatics; Genetic algorithms; Genomics; Humans; Least squares approximation; Linear regression; Mice; Predictive models; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE
Conference_Location
San Francisco, CA
Print_ISBN
0-7803-8439-3
Type
conf
DOI
10.1109/IEMBS.2004.1403798
Filename
1403798
Link To Document