Title :
Selecting informative genes from microarray dataset by incorporating gene ontology
Author :
Xu, Xian ; Zhang, Aidong
Author_Institution :
Dept. of Comput. Sci. & Eng., State Univ. of New York, Buffalo, NY, USA
Abstract :
Selecting informative genes from microarray experiments is one of the most important data analysis steps for deciphering biological information imbedded in such experiments. However, due to the characteristics of microarray technology and the underlying biology, namely large number of genes and limited number of samples, the statistical soundness of gene selection algorithm becomes questionable. One major problem is the high false discover rate. Microarray experiment is only one facet of current knowledge of the biological system under study. In this paper, we propose to alleviate this high false discover rate problem by integrating domain knowledge into the gene selection process. Gene ontology represents a controlled biological vocabulary and a repository of computable biological knowledge. It is shown in the literature that gene ontology-based similarities between genes carry significant information of the functional relationships. Integration of such domain knowledge into gene selection algorithms enables us to remove noisy genes intelligently. We propose an add-on algorithm applied to any single gene-based discriminative scores integrating domain knowledge from gene ontology annotation. Preliminary experiments are performed on publicly available colon cancer dataset to demonstrate the utility of the integration of domain knowledge for the purpose of gene selection. Our experiments show interesting results.
Keywords :
cancer; genetics; medical computing; molecular biophysics; ontologies (artificial intelligence); colon cancer dataset; computable biological knowledge; controlled biological vocabulary; domain knowledge; gene ontology; informative gene selection; microarray dataset; Acoustic noise; Biology computing; Cancer; Classification algorithms; DNA; Data analysis; Gene expression; Machine learning algorithms; Neoplasms; Ontologies;
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
DOI :
10.1109/BIBE.2005.51