Title :
Biclustering with Flexible Plaid Models to Unravel Interactions between Biological Processes
Author :
Henriques, Rui ; Madeira, Sara C.
Author_Institution :
Inst. Super. Tecnico, Univ. de Lisboa, Lisbon, Portugal
Abstract :
Genes can participate in multiple biological processes at a time and thus their expression can be seen as a composition of the contributions from the active processes. Biclustering under a plaid assumption allows the modeling of interactions between transcriptional modules or biclusters (subsets of genes with coherence across subsets of conditions) by assuming an additive composition of contributions in their overlapping areas. Despite the biological interest of plaid models, few biclustering algorithms consider plaid effects and, when they do, they place restrictions on the allowed types and structures of biclusters, and suffer from robustness problems by seizing exact additive matchings. We propose BiP (Biclustering using Plaid models), a biclustering algorithm with relaxations to allow expression levels to change in overlapping areas according to biologically meaningful assumptions (weighted and noise-tolerant composition of contributions). BiP can be used over existing biclustering solutions (seizing their benefits) as it is able to recover excluded areas due to unaccounted plaid effects and detect noisy areas non-explained by a plaid assumption, thus producing an explanatory model of overlapping transcriptional activity. Experiments on synthetic data support BiP´s efficiency and effectiveness. The learned models from expression data unravel meaningful and non-trivial functional interactions between biological processes associated with putative regulatory modules.
Keywords :
biology computing; genetics; pattern clustering; pattern matching; active processes; additive composition; additive matchings; biclustering algorithms; biological interest; biologically meaningful assumptions; expression data; flexible plaid models; gene subsets; multiple biological processes; noise-tolerant composition; noisy area detection; nontrivial functional interactions; overlapping areas; overlapping transcriptional activity; putative regulatory modules; synthetic data support; transcriptional modules; Additives; Biological processes; Biological system modeling; Computational modeling; Data models; Noise; Noise measurement; Plaid model; biclustering; biomedical data analysis;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2014.2388206