• DocumentCode
    951889
  • Title

    Incorporating Gene Functions into Regression Analysis of DNA-Protein Binding Data and Gene Expression Data to Construct Transcriptional Networks

  • Author

    Wei, Peng ; Pan, Wei

  • Author_Institution
    Sch. of Public Health, Minnesota Univ., Minneapolis, MN
  • Volume
    5
  • Issue
    3
  • fYear
    2008
  • Firstpage
    401
  • Lastpage
    415
  • Abstract
    Useful information on transcriptional networks has been extracted by regression analyses of gene expression data and DNA-protein binding data. However, a potential limitation of these approaches is their assumption on the common and constant activity level of a transcription factor (TF) on all of the genes in any given experimental condition, for example, any TF is assumed to be either an activator or a repressor, but not both, whereas it is known that some TFs can be dual regulators. Rather than assuming a common linear regression model for all of the genes, we propose using separate regression models for various gene groups; the genes can be grouped based on their functions or some clustering results. Furthermore, to take advantage of the hierarchical structure of many existing gene function annotation systems such as gene ontology (GO), we propose a shrinkage method that borrows information from relevant gene groups. Applications to a yeast data set and simulations lend support to our proposed methods. In particular, we find that the shrinkage method consistently works well under various scenarios. We recommend the use of the shrinkage method as a useful alternative to the existing methods.
  • Keywords
    DNA; genetics; molecular biophysics; molecular configurations; ontologies (artificial intelligence); proteins; regression analysis; DNA-protein binding; biomolecular structure; gene expression; gene function annotation systems; gene functions; gene ontology; regression analysis; shrinkage method; transcription factor; transcriptional networks; LASSO; Microarray; Shrinkage estimator; Stratified analysis; Transcription factor; Binding Sites; Computer Simulation; DNA; DNA-Binding Proteins; Gene Expression Profiling; Models, Biological; Oligonucleotide Array Sequence Analysis; Protein Binding; Regression Analysis; Signal Transduction; Transcription Factors;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.1062
  • Filename
    4359857