Title :
Attractive Feature Reduction Approach for Colon Data Classification
Author :
Al-Shalalfa, Mohammed ; Alhajj, Reda
Author_Institution :
Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB
Abstract :
In this paper, we try to identify a set of reduced features capable of distinguishing between two classes by performing double clustering using fuzzy c-means. We decided on using fuzzy c-means because a fuzzy model fits better the gene expression data analysis. Fuzziness parameter m is a major problem in applying fuzzy c- means method for clustering. In this approach, we applied fuzzy c-means clustering using different fuzziness parameters for two forms of microarray data. Support vector machine with different kernel functions are used for classification. As a result of the experiments conducted on the colon dataset, we have observed that CSVM is able to correctly classify the whole training and test sets when the data is log2 transformed and when in is close to 1.5.
Keywords :
biology computing; fuzzy set theory; pattern classification; pattern clustering; support vector machines; colon data classification; double clustering; feature reduction approach; fuzzy c-means clustering; gene expression data analysis; microarray data; support vector machine; Cancer; Clustering methods; Colon; Computer science; Data analysis; Diseases; Gene expression; Monitoring; Support vector machine classification; Support vector machines; Clustering; Fuzzy C-means(FCM).; classification; fuzziness parameter; microarray; support vector machine; validity analysis;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
Conference_Location :
Niagara Falls, Ont.
Print_ISBN :
978-0-7695-2847-2
DOI :
10.1109/AINAW.2007.103