Title :
A graph-based elastic net for variable selection and module identification for genomic data analysis
Author :
Xia, Zheng ; Zhou, Xiaobo ; Chen, Wei ; Chang, Chunqi
Author_Institution :
Dept. of Radiol., Methodist Hosp. Res. Inst., Houston, TX, USA
Abstract :
Recently a network-constraint regression model[1] is proposed to incorporate the prior biological knowledge to perform regression and variable selection. In their method, a l1-norm of the coefficients is defined to impose sparse, meanwhile a Laplacian operation on the biological graph is designed to encourage smoothness of the coefficients along the network. However the grouping effect of their Laplacian smoothness operation only exits when the two connected genes both have positive or negative effects on the response. To overcome this problem, we proposed to apply the Laplacian operation on the absolute values of the coefficients to take account of the positive and negative effects. Here, we call the presented method as graph-based elastic net (GENet) because the proposed method has similar grouping effect with elastic net(ENet)[2] except the smoothness of two coefficients are specified by the network in GENet. Further, an efficient algorithm which has same spirit with LARS [3] is developed to solve our optimization problem. Simulation studies showed that the proposed method has better performance than network-constrained regularization without absolute values. Application to Alzheimer´s disease(AD) microarray gene-expression dataset identified several subnetworks on Kyoto Encyclopedia of Genes and Genomes(KEGG) transcriptional pathways that are related to progression of AD. Many of those findings are confirmed by published literatures.
Keywords :
cellular biophysics; diseases; genomics; graph theory; medical computing; molecular biophysics; optimisation; regression analysis; sparse matrices; Alzheimer disease; Kyoto Encyclopedia of Genes and Genomes transcriptional pathways; Laplacian operation; Laplacian smoothness; genomic data analysis; graph-based elastic net; l1-norm; microarray gene expression; module identification; network-constrained regularization; network-constraint regression model; optimization; sparse; variable selection; Bioinformatics; Equations; Genomics; Laplace equations; Mathematical model; Optimization; Elastic Net; Laplacian graph; pathway;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-8306-8
Electronic_ISBN :
978-1-4244-8307-5
DOI :
10.1109/BIBM.2010.5706591