Title :
Integration of multiple data sources for identifying functional modules using Bayesian network
Author :
Jinlian Wang ; Hongyan Yuan ; Tadesse, Mahlet G. ; Ressom, Habtom W.
Author_Institution :
Lombardi Comprehensive Cancer Center, Georgetown Univ., Washington, DC, USA
Abstract :
Data integration is a crucial step in cancer related bioinformatics studies. Bayesian Networks (BNs) is one of the most commonly used methods for integration of multiple data sources. In this paper, we present a modified BN model that can capture and integrate heterogeneous data to increase its predictive performance. The model allows us to infer aberrant networks with scale-free and small world properties, and to group molecules into functional modules or pathways based on the primary function and biological features. Application of this method to gene and protein biomarkers of hepatocellular carcinoma (HCC) led to identification of modules that significantly contribute to HCC development and progression. The modules include cell cycle dysregulation, increased angiogenesis (e.g., vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival and differentiation (e.g., Wnt pathways). The central findings and conclusions derived from our modified BN model are consistent with those previously reported results.
Keywords :
belief networks; biochemistry; bioinformatics; blood vessels; cancer; cellular biophysics; data integration; genetics; molecular biophysics; physiological models; proteins; small-world networks; Bayesian network; HCC development; HCC progression; aberrant network; angiogenesis; bioinformatics study; biological features; blood vessel morphogenesis; cancer; cell cycle dysregulation; cellular differentiation; cellular proliferation; cellular survival; functional module identification; functional modules; functional pathways; gene biomarkers; hepatocellular carcinoma; modified BN model; molecule grouping; multiple data source integration; oxidative metabolic alterations; primary function; protein biomarkers; scale-free property; signaling pathway aberrant activation; small world property; vascular endothelial growth factor; Bayesian network; data integration; hepatocellular carcinoma; systems biology;
Conference_Titel :
Genomic Signal Processing and Statistics, (GENSIPS), 2012 IEEE International Workshop on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4673-5234-5
DOI :
10.1109/GENSIPS.2012.6507715