Title :
Selecting Informative Genes from Microarray Data for Cancer Classification with Genetic Programming Classifier Using K-Means Clustering and SNR Ranking
Author :
Hengpraprohm, Supoj ; Chongstitvatana, Prabhas
Author_Institution :
Dept. of Comput. Eng., Chulalongkorn Univ., Chulalongkorn
Abstract :
This paper presents a method for selecting informative features using K-Means clustering and SNR ranking. The performance of the proposed method was tested on cancer classification problems. Genetic Programming is employed as a classifier. The experimental results indicate that the proposed method yields higher accuracy than using the SNR ranking alone and higher than using all of the genes in classification. The clustering step assures that the selected genes have low redundancy, hence the classifier can exploit these features to obtain better performance.
Keywords :
biology computing; cancer; genetics; K-means clustering; SNR ranking; bioinformatics; cancer classification; genetic programming classifier; microarray data; selecting informative genes; Cancer; Clustering algorithms; DNA; Data analysis; Data engineering; Genetic programming; Information technology; Organisms; Partitioning algorithms; Testing;
Conference_Titel :
Frontiers in the Convergence of Bioscience and Information Technologies, 2007. FBIT 2007
Conference_Location :
Jeju City
Print_ISBN :
978-0-7695-2999-8
DOI :
10.1109/FBIT.2007.84