Author/Authors :
Yousefi Moteghaed، Niloofar نويسنده Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University , , Maghooli، Keivan نويسنده Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University , , Pirhadi، Shiva نويسنده Department of Biomedical Engineering, Science and Research Branch, Islamic Azad University , , Garshasbi، Masoud نويسنده Department of Medical Genetics, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran ,
Abstract :
The improvement of high through put gene profiling based microarrays technology has provided monitoring the expression value
of thousands of genes simultaneously. Detailed examination of changes in expression levels of genes can help physicians to have
efficient diagnosing, classification of tumors and cancer’s types as well as effective treatments. Finding genes that can classify the
group of cancers correctly based on hybrid optimization algorithms is the main purpose of this paper. In this paper, a hybrid particle
swarm optimization and genetic algorithm method are used for gene selection and also artificial neural network (ANN) is adopted
as the classifier. In this work, we have improved the ability of the algorithm for the classification problem by finding small group of
biomarkers and also best parameters of the classifier. The proposed approach is tested on three benchmark gene expression data
sets: Blood (acute myeloid leukemia, acute lymphoblastic leukemia), colon and breast datasets. We used 10 fold cross validation to
achieve accuracy and also decision tree algorithm to find the relation between the biomarkers for biological point of view. To test the
ability of the trained ANN models to categorize the cancers, we analyzed additional blinded samples that were not previously used for
the training procedure. Experimental results show that the proposed method can reduce the dimension of the data set and confirm the
most informative gene subset and improve classification accuracy with best parameters based on datasets.