• DocumentCode
    2039051
  • Title

    Utilizing RNA-Seq data for cancer network inference

  • Author

    Ying Cai ; Fendler, B. ; Atwal, G.S.

  • Author_Institution
    Quantitative Biol., Cold Spring Harbor Lab., Cold Spring Harbor, TX, USA
  • fYear
    2012
  • fDate
    2-4 Dec. 2012
  • Firstpage
    46
  • Lastpage
    49
  • Abstract
    An important challenge in cancer systems biology is to uncover the complex network of interactions between genes (tumor suppressor genes and oncogenes) implicated in cancer. Next generation sequencing provides unparalleled ability to probe the expression levels of the entire set of cancer genes and their transcript isoforms. However, there are onerous statistical and computational issues in interpreting high-dimensional sequencing data and inferring the underlying genetic network. In this study, we analyzed RNA-Seq data from lymphoblastoid cell lines derived from a population of 69 human individuals and implemented a probabilistic framework to construct biologically-relevant genetic networks. In particular, we employed a graphical lasso analysis, motivated by considerations of the maximum entropy formalism, to estimate the sparse inverse covariance matrix of RNA-Seq data. Gene ontology, pathway enrichment and protein-protein path length analysis were all carried out to validate the biological context of the predicted network of interacting cancer gene isoforms.
  • Keywords
    RNA; biology computing; cancer; cellular biophysics; data analysis; genetics; genomics; maximum entropy methods; molecular biophysics; molecular configurations; probability; proteins; sequential estimation; statistical analysis; tumours; RNA-Seq data analysis; biologically-relevant genetic networks; cancer network inference; cancer systems biology; complex network interactions; computational issues; expression levels; gene ontology; graphical lasso analysis; high-dimensional sequencing data; human individuals; lymphoblastoid cell lines; maximum entropy formalism; next generation sequencing; oncogenes; onerous statistical issues; probabilistic framework; protein-protein path length analysis; sparse inverse covariance matrix estimation; transcript isoforms; tumor suppressor genes; RNA-Seq; cancer; graphical lasso; maximum entropy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, (GENSIPS), 2012 IEEE International Workshop on
  • Conference_Location
    Washington, DC
  • ISSN
    2150-3001
  • Print_ISBN
    978-1-4673-5234-5
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2012.6507723
  • Filename
    6507723