Title of article :
Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data
Author/Authors :
Lee, JungJun Department of Statistics - Korea University - Seoul, Republic of Korea , Kim, SungHwan Department of Applied Statistics - Konkuk University - Seoul, Republic of Korea , Jhong, Jae-Hwan Department of Statistics - Korea University - Seoul, Republic of Korea , Koo, Ja-Yong Department of Statistics - Korea University - Seoul, Republic of Korea
Abstract :
In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due
to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model
(JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To
this end, the modifed Cholesky decomposition is utilized to parametrize entries of a precision matrix. Te JMCCM maximizes
the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects
explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao
and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is efectively
achieved via the proposed variable selection scheme and contributes to exploring signifcant hub genes shown to be concordant
to a priori biological evidence. In simulation studies, we confrm that our model selection efciently identifes the true underlying
networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene
networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway.
Keywords :
eQTL , Estimation , Covariance , GCV
Journal title :
Computational and Mathematical Methods in Medicine