DocumentCode :
2498071
Title :
An MM-Based Optimization Algorithm for Sparse Linear Modeling on Microarray Data Analysis
Author :
Chen, Xiaohui ; Gottardo, Raphael
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of British Columbia, Vancouver, BC, Canada
fYear :
2009
fDate :
11-13 June 2009
Firstpage :
1
Lastpage :
4
Abstract :
Sparsity is crucial for high-dimensional statistical modeling. On one hand, dimensionality reduction can reduce the variability of estimation and thus provide reliable predictive power. On the other hand, the selected sub-model can discover and emphasize the underlying dependencies, which is useful for objective interpretation. Many variable selection methods have been proposed in literatures. For a prominent example, Least Absolute Shrinkage and Selection Operator (lasso) in linear regression context has been extensively explored. This paper discusses a class of scaled mixture of Gaussian models from both a penalized likelihood and a Bayesian regression point of view. We propose an Majorize-Minimize (MM) algorithm to find the Maximum A Posteriori (MAP) estimator, where the EM algorithm can be stuck at local optimum for some members in this class. Simulation studies show the outperformance of proposed algorithm in nonstochastic design variable selection scenario. The proposed algorithm is applied to a real large-scale E.coli data set with known bona fide interactions for constructing sparse gene regulatory networks. We show that our regression networks with a properly chosen prior can perform comparably to state-of-the-art regulatory network construction algorithms.
Keywords :
Bayes methods; bioinformatics; data reduction; genomics; maximum likelihood estimation; microorganisms; optimisation; Bayesian regression approach; Gaussian models; dimensionality reduction; high dimensional statistical modeling; large scale E.coli data set; majorize-minimize based optimization algorithm; maximum a posteriori estimator; microarray data analysis; penalized likelihood approach; sparse gene regulatory networks; sparse linear modeling; variable selection methods; Algorithm design and analysis; Bayesian methods; Cancer; Data analysis; Input variables; Large-scale systems; Linear regression; Predictive models; Regulators; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2901-1
Electronic_ISBN :
978-1-4244-2902-8
Type :
conf
DOI :
10.1109/ICBBE.2009.5162334
Filename :
5162334
Link To Document :
بازگشت