Title :
Ensemble-based classifiers for cancer classification using human tumor microarray data
Author :
Margoosian, Argin ; Abouei, J.
Author_Institution :
Dept. of Electr. & Comput. Eng., Yazd Univ., Yazd, Iran
Abstract :
In this paper, two cancer classification techniques based on multicategory microarray data sets are presented. Due to the high dimensionality of microarray data sets, choosing reliable feature selection and classification algorithms with a high degree of accuracy and a low complexity is a crucial task in bioinformatics. Toward this goal, this paper aims to maximize the cancer classification accuracy using two reliable ensemble-based classifiers namely the ensemble of naive bayes and the ensemble of k-nearest neighbor. Simulation results show that our classifiers have considerably better accuracy than some conventional classification techniques such as the Support Vector Machine (SVM) and artificial neural networks in the field of multicategory microarray cancer classification based on fourteen cancer data set. However, the run time of the introduced ensemble-based classifiers is longer when the schemes use whole features. To reduce the time complexity while preserving the same classification accuracy as before, we use the recursive feature elimination based on the multiple support vector machine classifier to select more informative genes before applying the ensemble-based classifiers. Numerical evaluations show at least 30% improvement in the classification accuracy of our schemes when compared to the SVM-one versus one rule. In addition, our schemes are much more robust to the feature elimination and display a high accuracy in the case of low number of features.
Keywords :
Bayes methods; bioinformatics; cancer; genetics; genomics; medical computing; numerical analysis; support vector machines; tumours; artificial neural network; bioinformatics; cancer classification technique; ensemble-based classifier; feature classification algorithm; feature selection algorithm; gene selection; human tumor microarray data; k-nearest neighbor ensemble; multicategory microarray data set; naive bayes ensemble; numerical evaluation; recursive feature elimination; support vector machine; Accuracy; Cancer; Computational complexity; Simulation; Support vector machines; Training; Cancer classification; ensemble-based methods; microarray data set; naive bayes classifier; recursive feature elimination;
Conference_Titel :
Electrical Engineering (ICEE), 2013 21st Iranian Conference on
Conference_Location :
Mashhad
DOI :
10.1109/IranianCEE.2013.6599553