Title :
Identifying Context-Specific Transcription Factor Targets From Prior Knowledge and Gene Expression Data
Author :
Fertig, Elana J. ; Favorov, Alexander V. ; Ochs, Michael F.
Author_Institution :
Dept. of Oncology, Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Numerous methodologies, assays, and databases presently provide candidate targets of transcription factors (TFs). However, TFs rarely regulate their targets universally. The context of activation of a TF can change the transcriptional response of targets. Direct multiple regulation typical to mammalian genes complicates direct inference of TF targets from gene expression data. We present a novel statistic that infers context-specific TF regulation based upon the CoGAPS algorithm, which infers overlapping gene expression patterns resulting from coregulation. Numerical experiments with simulated data showed that this statistic correctly inferred targets that are common to multiple TFs, except in cases where the signal from a TF is negligible relative to noise level and signal from other TFs. The statistic is robust to moderate levels of error in the simulated gene sets, identifying fewer false positives than false negatives. Significantly, the regulatory statistic refines the number of TF targets relevant to cell signaling in gastrointestinal stromal tumors (GIST) to genes consistent with the phosphorylation patterns of TFs identified in previous studies. As formulated, the proposed regulatory statistic has wide applicability to inferring set membership in integrated datasets. This statistic could be naturally extended to account for prior probabilities of set membership or to add candidate gene targets.
Keywords :
bioinformatics; biomedical engineering; cellular biophysics; genetics; genomics; molecular biophysics; proteins; statistical analysis; tumours; CoGAPS algorithm; GIST; TF activation; TF phosphorylation pattern; TF target direct inference; TF target regulation; cell signaling; context-specific transcription factor target identification; coregulation; direct multiple regulation; false negative identification; false positive identification; gastrointestinal stromal tumor; gene expression data; mammalian gene; noise level; simulated gene set; statistic method; target transcriptional response; Bioinformatics; genetic expression; genomics; Algorithms; Cell Line, Tumor; Computer Simulation; Gastrointestinal Stromal Tumors; Gene Expression Profiling; Genomics; Humans; Models, Genetic; Neoplasm Proteins; Phosphorylation; Transcription Factors;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2013.2263390