Title :
Identifying significant genes from microarray data
Author :
Chuang, Han-Yu ; Liu, Hongfang ; Brown, Stuart ; McMunn-Coffran, Cameron ; Kao, Cheng-Yan ; Hsu, D. Frank
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Microarray technology is a recent development in experimental molecular biology which can produce quantitative expression measurements for thousands of genes in a single, cellular mRNA sample. These many gene expression measurements form a composite profile of the sample, which can be used to differentiate samples from different classes such as tissue types or treatments. However, for the gene expression profile data obtained in a specific comparison, most likely only some of the genes will, be differentially expressed between the classes, while many other genes have similar expression levels. Selecting a list of informative differential genes from these data is important for microarray data analysis. In this paper, we describe a framework for selecting informative genes, called ranking and combination analysis (RAC), which combines various existing informative gene selection methods. We conducted experiments using three data sets and six existing feature selection methods. The results show that the RAC framework is a robust and efficient approach to identify informative gene for microarray data. The combination approach on two selecting methods almost always performed better than the less efficient individual, and in many cases, better than both. More significantly, when considering all three data sets together, the combination approach, on average, outperforms each individual feature selection method. All of these indicate that RCA might be a viable and feasible approach for the microarray gene expression analysis.
Keywords :
biology computing; data analysis; genetics; pattern classification; feature selection; gene expression profile; informative gene selection; microarray data; ranking and combination analysis; significant genes identification; Biomedical computing; Biomedical engineering; Biomedical informatics; Biomedical measurements; Computer science; Data analysis; Gene expression; Information science; Information systems; Medical diagnostic imaging;
Conference_Titel :
Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
Print_ISBN :
0-7695-2173-8
DOI :
10.1109/BIBE.2004.1317366