• DocumentCode
    241189
  • Title

    A novel approach for automatic gene selection and classification of gene based colon cancer datasets

  • Author

    Rathore, Saima ; Iftikhar, Muhammad Aksam ; Hussain, Mutawarra

  • Author_Institution
    DCIS, Pakistan Inst. of Eng. & Appl. Sci., Islamabad, Pakistan
  • fYear
    2014
  • fDate
    8-9 Dec. 2014
  • Firstpage
    42
  • Lastpage
    47
  • Abstract
    Colon cancer heavily changes the composition of human genes (expressions). The deviation in the chemical composition of genes can be exploited to automatically diagnose colon cancer. The major challenge in the analysis of human gene based datasets is their large dimensionality. Therefore, efficient techniques are needed to select discerning genes. In this research article, we propose a novel classification technique that exploits the variations in gene expressions for classifying colon gene samples into normal and malignant classes, and quite intelligently tackles the larger dimensionality of gene based datasets. Previously individual feature selection techniques have been used for selection of discerning gene expressions, however, their performance is limited. In this research study, we propose a feed forward gene selection technique, wherein, two feature selection techniques are used one after the other. The genes selected by the first technique are fed as input to the second feature selection technique that selects genes from the given gene subset. The selected genes are then classified by using linear kernel of support vector machines (SVM). The feed forward approach of gene selection has shown improved performance. The proposed technique has been tested on three standard colon cancer datasets, and improved performance has been observed. It is observed that feed forward method of gene selection substantially reduces the size of gene based datasets, thereby reducing the computational time to a great extent. Performance of the proposed technique has also been compared with existing techniques of colon cancer diagnosis, and improved performance has been observed. Therefore, we hope that the proposed technique can be effectively used for diagnosis of colon cancer.
  • Keywords
    cancer; feature selection; genetics; medical computing; patient diagnosis; pattern classification; support vector machines; automatic gene classification; automatic gene selection; colon cancer diagnosis; colon cancer heavily; colon gene sample classification; computational time reduction; discerning gene expression selection; expression composition deviation; feature selection techniques; feed forward gene selection technique; gene based colon cancer datasets; gene chemical composition; human gene based dataset analysis; human gene composition; support vector machines; Accuracy; Cancer; Colon; Genetic expression; Kernel; Support vector machines; Vectors; Chi-Square; Colon cancer; Gene expressions; mRMR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Technologies (ICET), 2014 International Conference on
  • Conference_Location
    Islamabad
  • Print_ISBN
    978-1-4799-6088-0
  • Type

    conf

  • DOI
    10.1109/ICET.2014.7021014
  • Filename
    7021014