DocumentCode :
241189
Title :
A novel approach for automatic gene selection and classification of gene based colon cancer datasets
Author :
Rathore, Saima ; Iftikhar, Muhammad Aksam ; Hussain, Mutawarra
Author_Institution :
DCIS, Pakistan Inst. of Eng. & Appl. Sci., Islamabad, Pakistan
fYear :
2014
fDate :
8-9 Dec. 2014
Firstpage :
42
Lastpage :
47
Abstract :
Colon cancer heavily changes the composition of human genes (expressions). The deviation in the chemical composition of genes can be exploited to automatically diagnose colon cancer. The major challenge in the analysis of human gene based datasets is their large dimensionality. Therefore, efficient techniques are needed to select discerning genes. In this research article, we propose a novel classification technique that exploits the variations in gene expressions for classifying colon gene samples into normal and malignant classes, and quite intelligently tackles the larger dimensionality of gene based datasets. Previously individual feature selection techniques have been used for selection of discerning gene expressions, however, their performance is limited. In this research study, we propose a feed forward gene selection technique, wherein, two feature selection techniques are used one after the other. The genes selected by the first technique are fed as input to the second feature selection technique that selects genes from the given gene subset. The selected genes are then classified by using linear kernel of support vector machines (SVM). The feed forward approach of gene selection has shown improved performance. The proposed technique has been tested on three standard colon cancer datasets, and improved performance has been observed. It is observed that feed forward method of gene selection substantially reduces the size of gene based datasets, thereby reducing the computational time to a great extent. Performance of the proposed technique has also been compared with existing techniques of colon cancer diagnosis, and improved performance has been observed. Therefore, we hope that the proposed technique can be effectively used for diagnosis of colon cancer.
Keywords :
cancer; feature selection; genetics; medical computing; patient diagnosis; pattern classification; support vector machines; automatic gene classification; automatic gene selection; colon cancer diagnosis; colon cancer heavily; colon gene sample classification; computational time reduction; discerning gene expression selection; expression composition deviation; feature selection techniques; feed forward gene selection technique; gene based colon cancer datasets; gene chemical composition; human gene based dataset analysis; human gene composition; support vector machines; Accuracy; Cancer; Colon; Genetic expression; Kernel; Support vector machines; Vectors; Chi-Square; Colon cancer; Gene expressions; mRMR;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Technologies (ICET), 2014 International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4799-6088-0
Type :
conf
DOI :
10.1109/ICET.2014.7021014
Filename :
7021014
Link To Document :
بازگشت