Title :
Biomarker Identification by Knowledge-Driven Multi-Level ICA and Motif Analysis
Author :
Chen, Li ; Wang, Chen ; Shih, Ie-Ming ; Wang, Tian-Li ; Zhang, Zhen ; Wang, Yue ; Clarke, Robert ; Hoffman, Eric ; Xuan, Jianhua
Author_Institution :
Virginia Tech, Arlington
Abstract :
Many statistical methods often fail to identify biologically meaningful biomarkers related to a specific disease under study from expression data alone. In this paper, we develop a novel strategy, namely knowledge-driven multi-level independent component analysis (ICA), to infer regulatory signals and identify biologically relevant biomarkers from microarray data. Specifically, based on multi-level clustering results and partial prior knowledge, we apply ICA to find stable disease specific linear regulatory modes and then extract associated biomarker genes. A statistical test is designed to evaluate the significance of transcription factor enrichment for extracted gene set based on motif information. The experimental results on an Rsf-1 induced microarray data set show that our knowledge-driven method can extract more biologically meaningful biomarkers with significant enrichment of transcription factors related to ovarian cancer compared to other gene selection methods with/without prior knowledge.
Keywords :
biology computing; data analysis; diseases; independent component analysis; pattern clustering; biologically relevant biomarkers; biomarker identification; diseases; knowledge-driven multilevel ICA; knowledge-driven multilevel independent component analysis; microarray data; motif analysis; multilevel clustering; transcription factor; Biomarkers; Cancer; Data mining; Diseases; Genetics; Independent component analysis; Mathematical model; Matrix decomposition; Oncology; Testing;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.58