DocumentCode :
1392860
Title :
Scalable learning of large networks
Author :
Roy, Sandip ; Plis, S. ; Werner-Washburne, M. ; Lane, T.
Author_Institution :
Dept. of Comput. Sci., Univ. of New Mexico, Albuquerque, NM, USA
Volume :
3
Issue :
5
fYear :
2009
fDate :
9/1/2009 12:00:00 AM
Firstpage :
404
Lastpage :
413
Abstract :
Cellular networks inferred from condition-specific microarray data can capture the functional rewiring of cells in response to different environmental conditions. Unfortunately, many algorithms for inferring cellular networks do not scale to whole-genome data with thousands of variables. We propose a novel approach for scalable learning of large networks: cluster and infer networks (CIN). CIN learns network structures in two steps: (a) partition variables into smaller clusters, and (b) learn networks per cluster. We optionally revisit the cluster assignment of variables with poor neighbourhoods. Results on networks with known topologies suggest that CIN has substantial speed benefits, without substantial performance loss. We applied our approach to microarray compendia of glucose-starved yeast cells. The inferred networks had significantly higher number of subgraphs representing meaningful biological dependencies than random graphs. Analysis of subgraphs identified biological processes that agreed well with existing information about yeast populations under glucose starvation, and also implicated novel pathways that were previously not known to be associated with these populations.
Keywords :
bioinformatics; cellular biophysics; complex networks; genetics; inference mechanisms; learning (artificial intelligence); cellular networks; cluster and infer networks; condition-specific microarray data; glucose-starved yeast cells; large networks; random graphs; scalable learning; subgraphs; whole-genome data;
fLanguage :
English
Journal_Title :
Systems Biology, IET
Publisher :
iet
ISSN :
1751-8849
Type :
jour
DOI :
10.1049/iet-syb.2008.0161
Filename :
5243217
Link To Document :
بازگشت