• DocumentCode
    2531356
  • Title

    Inference of Gene Pathways Using Gaussian Mixture Models

  • Author

    Ko, Younhee ; Zhai, ChengXiang ; Rodriguez-Zas, Sandra L.

  • Author_Institution
    Biol. Univ. of Illinois, Urbana
  • fYear
    2007
  • fDate
    2-4 Nov. 2007
  • Firstpage
    362
  • Lastpage
    367
  • Abstract
    Identification of gene-gene interactions and complete characterization of gene pathways are critical in understanding the transcript processes underlying biological processes. Bayesian network is a powerful framework to infer gene pathways. We developed a novel Bayesian network, in which we use Gaussian mixture models to describe continuous gene expression data and learn gene pathways. Mixture parameters were estimated using an EM algorithm, while the optimal number of mixture component for each gene node and the optimal network topology best supported by the data were identified using the Bayesian Information criterion (BIC). We applied the proposed approach to a histone pathway in yeast and to a less explored circadian rhythm pathway in honeybee. The performance of the proposed approach was compared against alternative Bayesian network algorithms that either discretize the gene expression information or use single distribution instead of mixtures. Evaluation shows that our approach outperforms other approaches in terms of more accurate inference of the known network and can effectively predict gene pathways with different topology using continuous data. In addition, the estimated mixture model can facilitate an intuitive description of the gene node behavior, thus enhancing the interpretation of the inferred network.
  • Keywords
    Bayes methods; Gaussian processes; biochemistry; biology computing; cellular biophysics; circadian rhythms; expectation-maximisation algorithm; genetics; inference mechanisms; molecular biophysics; Bayesian information criterion; Bayesian network; EM algorithm; Gaussian mixture models; circadian rhythm; gene expression; gene pathways; gene-gene interactions; histone pathway; honeybee; inference; optimal network topology; transcript processes; yeast; Bayesian methods; Bioinformatics; Biological processes; Biological system modeling; Fungi; Gene expression; Inference algorithms; Network topology; Parameter estimation; Solid modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3031-4
  • Type

    conf

  • DOI
    10.1109/BIBM.2007.59
  • Filename
    4413078