• DocumentCode
    3714
  • Title

    Biclustering with Flexible Plaid Models to Unravel Interactions between Biological Processes

  • Author

    Henriques, Rui ; Madeira, Sara C.

  • Author_Institution
    Inst. Super. Tecnico, Univ. de Lisboa, Lisbon, Portugal
  • Volume
    12
  • Issue
    4
  • fYear
    2015
  • fDate
    July-Aug. 1 2015
  • Firstpage
    738
  • Lastpage
    752
  • Abstract
    Genes can participate in multiple biological processes at a time and thus their expression can be seen as a composition of the contributions from the active processes. Biclustering under a plaid assumption allows the modeling of interactions between transcriptional modules or biclusters (subsets of genes with coherence across subsets of conditions) by assuming an additive composition of contributions in their overlapping areas. Despite the biological interest of plaid models, few biclustering algorithms consider plaid effects and, when they do, they place restrictions on the allowed types and structures of biclusters, and suffer from robustness problems by seizing exact additive matchings. We propose BiP (Biclustering using Plaid models), a biclustering algorithm with relaxations to allow expression levels to change in overlapping areas according to biologically meaningful assumptions (weighted and noise-tolerant composition of contributions). BiP can be used over existing biclustering solutions (seizing their benefits) as it is able to recover excluded areas due to unaccounted plaid effects and detect noisy areas non-explained by a plaid assumption, thus producing an explanatory model of overlapping transcriptional activity. Experiments on synthetic data support BiP´s efficiency and effectiveness. The learned models from expression data unravel meaningful and non-trivial functional interactions between biological processes associated with putative regulatory modules.
  • Keywords
    biology computing; genetics; pattern clustering; pattern matching; active processes; additive composition; additive matchings; biclustering algorithms; biological interest; biologically meaningful assumptions; expression data; flexible plaid models; gene subsets; multiple biological processes; noise-tolerant composition; noisy area detection; nontrivial functional interactions; overlapping areas; overlapping transcriptional activity; putative regulatory modules; synthetic data support; transcriptional modules; Additives; Biological processes; Biological system modeling; Computational modeling; Data models; Noise; Noise measurement; Plaid model; biclustering; biomedical data analysis;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2388206
  • Filename
    7001608