Title :
Exploring novel algorithms for the prediction of cancer classification
Author :
Chen, Austin H. ; Hsu, Jen-Chieh
Abstract :
In the past decade, DNA microarray technologies have had a great impact on cancer genome research; this technology has been viewed as a promising approach in predicting cancer classes and prognosis outcomes. In this paper, we proposed two systematic methods which can predict cancer classification. By applying the genetic algorithm gene selection (GAGS) method in order to find an optimal information gene subset, we avoid the over-fitting problem caused by attempting to apply a large number of genes to a small number of samples. By extracting significant samples (which we refer to as support vector samples because they are located only on support vectors), we allow the back propagation neural network (BPNN) to learn more tasks. We call this approach the multi-task support vector sample learning (MTSVSL) technique. We demonstrate experimentally that the GAGS and MTSVSL methods yield superior classification performance with application to leukemia and prostate cancer gene expression datasets. Our proposed GAGS and MTSVSL methods are novel approaches which are expedient and perform exceptionally well in cancer diagnosis and clinical outcome prediction.
Keywords :
backpropagation; biology computing; cancer; diseases; genetic algorithms; lab-on-a-chip; neural nets; support vector machines; BPNN; DNA microarray technologies; GAGS; MTSVSL; back propagation neural network; cancer classification prediction; cancer genome research; exploring novel algorithms; genetic algorithm gene selection; multitask support vector sample learning; support vector samples; Bioinformatics; Cancer; DNA; Gene expression; Genetic algorithms; Genomics; Machine learning; Neural networks; Prediction algorithms; Support vector machines; back propagation neural networking; cancer classification; gene expression profiling; genetic algorithm gene selection; multi task learning; support vectors;
Conference_Titel :
Software Engineering and Data Mining (SEDM), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-7324-3
Electronic_ISBN :
978-89-88678-22-0