• DocumentCode
    464226
  • Title

    Attractive Feature Reduction Approach for Colon Data Classification

  • Author

    Al-Shalalfa, Mohammed ; Alhajj, Reda

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB
  • Volume
    1
  • fYear
    2007
  • fDate
    21-23 May 2007
  • Firstpage
    678
  • Lastpage
    683
  • Abstract
    In this paper, we try to identify a set of reduced features capable of distinguishing between two classes by performing double clustering using fuzzy c-means. We decided on using fuzzy c-means because a fuzzy model fits better the gene expression data analysis. Fuzziness parameter m is a major problem in applying fuzzy c- means method for clustering. In this approach, we applied fuzzy c-means clustering using different fuzziness parameters for two forms of microarray data. Support vector machine with different kernel functions are used for classification. As a result of the experiments conducted on the colon dataset, we have observed that CSVM is able to correctly classify the whole training and test sets when the data is log2 transformed and when in is close to 1.5.
  • Keywords
    biology computing; fuzzy set theory; pattern classification; pattern clustering; support vector machines; colon data classification; double clustering; feature reduction approach; fuzzy c-means clustering; gene expression data analysis; microarray data; support vector machine; Cancer; Clustering methods; Colon; Computer science; Data analysis; Diseases; Gene expression; Monitoring; Support vector machine classification; Support vector machines; Clustering; Fuzzy C-means(FCM).; classification; fuzziness parameter; microarray; support vector machine; validity analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
  • Conference_Location
    Niagara Falls, Ont.
  • Print_ISBN
    978-0-7695-2847-2
  • Type

    conf

  • DOI
    10.1109/AINAW.2007.103
  • Filename
    4221136