• DocumentCode
    464292
  • Title

    Inferring Regulatory Interactions between Transcriptional Factors and Genes by Propagating Known Regulatory Links

  • Author

    Zhong, Qian ; Boscolo, Riccardo ; Gardner, Timothy S. ; Roychowdhury, Vwani P.

  • Author_Institution
    Dept. of Electr. Eng., California Univ., Los Angeles, CA
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    205
  • Lastpage
    211
  • Abstract
    Determining transcriptional regulatory networks has been one of the most important goals in the field of functional genomics. Despite the recent advances in experimental techniques, complementary computational techniques have lagged behind. We introduce a novel computational methodology that uses DNA microarray data and known regulatory interactions to predict unknown regulatory interactions. Our method involves three steps: in the training stage, we utilize network component analysis (NCA) (Liao et al., 2003; Kao et al., 2004; Boscolo et al., 2005) to reconstruct the hidden activity profiles of transcriptional factors (TF); then we cluster TFs into functional modules according to the similarities of their reconstructed activity profiles; in the prediction stage, we infer additional TF-gene regulatory links by selecting TF profiles that best interpret genes expression profiles via a linear model. We applied the methodology to a gene expression dataset of bacterium Escherichia coli, whose partial TF-gene regulatory structure is obtained from RegulonDB (Salgado et al., 2004). Cross-validation results show that when the profiles of all TFs regulating a gene are reconstructed from NCA, we could identify 36% of the TF-gene interactions, and the prediction accuracy is 89%. And when the profiles of partial (50% or more) TFs regulating a gene can be reconstructed, we can identify 14% of the TF-gene interactions, and the accuracy rate is 69%. These represent some of the best known accuracy and coverage statistics reported in the literature so far
  • Keywords
    DNA; biology computing; genetics; DNA microarray data; Escherichia coli; functional genomics; gene expression dataset; network component analysis; regulatory interactions; regulatory links; transcriptional factors; transcriptional regulatory networks; Bioinformatics; Biomedical computing; Computational biology; Computational intelligence; DNA computing; Genetic expression; Genomics; Predictive models; Statistics; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221225
  • Filename
    4221225