DocumentCode :
2458988
Title :
Microarray data feature selection using hybrid genetic algorithm simulated annealing
Author :
Perez, Meir ; Marwala, Tshilidzi
Author_Institution :
Sch. of Electr. & Electron. Eng., Univ. of Johannesburg, Johannesburg, South Africa
fYear :
2012
fDate :
14-17 Nov. 2012
Firstpage :
1
Lastpage :
5
Abstract :
Microarray data feature selection is crucial for the development of a viable cancer diagnostic system based on microarray data. This paper assesses the effectiveness of the Hybrid Genetic Algorithm Simulated Annealing (HGASA) algorithm in selecting features for various classification architectures. HGASA combines the parallel search capability of Genetic Algorithm (GA) with the flexibility of Simulated Annealing (SA). The algorithm is guided by Separability Index, which quantifies the extent of class separability demonstrated by a combination of features. Four classifiers are used in the assessment: Artificial Neural Network (ANN), Support Vector Machine (SVM), Naïve Bayesian Classifier (NBC) and K-Nearest Neighbour (KNN) classifier. Results from HGSA is compared to those from standard GA as well as to those from Population based incremental Learning (PBIL) algorithm. Two data sets are used facilitate this analysis: a prostate cancer data set and a lymphoma data set. For the prostate cancer data set, features selected by the HGASA attained the highest classification accuracy on the SVM classifier with an accuracy of 88%. For the Lymphoma data set, the highest classification accuracy was attained using the ANN classifier, which attained an accuracy of 95%. The performance of the HGASA is ascribed to its ability to search the feature space more thoroughly by employing a deeper exploration of the feature space, when compared to GA and PBIL.
Keywords :
cancer; genetic algorithms; lab-on-a-chip; learning (artificial intelligence); medical computing; neural nets; patient diagnosis; pattern classification; simulated annealing; support vector machines; ANN classifier; HGASA algorithm; K-nearest neighbour; KNN classifier; NBC classifier; Naïve Bayesian classifier; PBIL algorithm; SVM classifier; artificial neural network; hybrid genetic algorithm simulated annealing; lymphoma data set; microarray classification architectures; microarray data feature selection; population based incremental learning algorithm; prostate cancer data set; separability index; support vector machine; viable cancer diagnostic system; Accuracy; Artificial neural networks; Cancer; Classification algorithms; Genetic algorithms; Silicon; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical & Electronics Engineers in Israel (IEEEI), 2012 IEEE 27th Convention of
Conference_Location :
Eilat
Print_ISBN :
978-1-4673-4682-5
Type :
conf
DOI :
10.1109/EEEI.2012.6377146
Filename :
6377146
Link To Document :
بازگشت