DocumentCode :
2625481
Title :
Applying Data Mining Techniques for Cancer Classification from Gene Expression Data
Author :
Yeh, Jinn-Yi ; Wu, Tai-Shi ; Wu, Min-Che ; Chang, Der-Ming
Author_Institution :
Nat. Chiayi Univ., Chiayi
fYear :
2007
fDate :
21-23 Nov. 2007
Firstpage :
703
Lastpage :
708
Abstract :
Recent studies on molecular level classification of tissues have produced remarkable results, and indicated that gene expression assays could significantly aid in the development of efficient cancer diagnosis and classification platforms. However, cancer classification based on the DNA array data is still a difficult problem. The main challenge is the overwhelming number of genes relative to the number of training samples. It makes accurate classification of data more difficult. This paper applies genetic algorithms (GA) with an initial solution provided by t- statistics (t-GA) for selecting a group of relevant genes from cancer microarray data. The decision tree based cancer classifier is then built on top of these selected genes. The performance of this approach is evaluated by comparing with other gene selection methods using the publicly available gene expression datasets. Experimental results indicate that t-GA has the highest accurate rate among different methods. The Z-score figure also shows that the gene selection operation provided by t-GA is reproducible.
Keywords :
DNA; cancer; data mining; genetic algorithms; DNA; cancer classification; data mining; gene expression data; genetic algorithms; Cancer; Classification tree analysis; DNA; Data mining; Decision trees; Fluorescence; Gene expression; Genetic algorithms; RNA; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
Type :
conf
DOI :
10.1109/ICCIT.2007.153
Filename :
4420341
Link To Document :
بازگشت