• DocumentCode
    464234
  • Title

    Application of Double Clustering to Gene Expression Data for Class Prediction

  • Author

    Al-Shalalfa, Mohammed ; Alhajj, Reda

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB
  • Volume
    1
  • fYear
    2007
  • fDate
    21-23 May 2007
  • Firstpage
    733
  • Lastpage
    738
  • Abstract
    Extracting significant features from gene expression data is a hot subject that continues to receive great attention. Many methods have been proposed in the literature to deal with this issue, but all of these methods deal with features obtained directly from the data. Since microarray data exhibit a high degree of noise, in this paper we try to reduce the noise by using double clustering approach to identify reduced set of features capable of distinguishing between two classes. Also, we showed that the transformation of the data plays a significant role in classification. We have used two forms of data, and we have used k-means and self organizing map for clustering. Support vector machine and binary decision trees are used for classification. As a result of the conducted experiments on AML/ALL data, we have observed that CSVM is able to correctly classify the whole training and testing data when the data is log2 transformed using only few features.
  • Keywords
    binary decision diagrams; biology computing; decision trees; pattern classification; pattern clustering; self-organising feature maps; support vector machines; binary decision trees; double clustering approach; gene expression; k-means clustering; self organizing map; support vector machine; Classification tree analysis; Data mining; Decision trees; Feature extraction; Gene expression; Noise reduction; Organizing; Support vector machine classification; Support vector machines; Testing; classification; clustering; feature reduction; microarray; support vector machine.;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
  • Conference_Location
    Niagara Falls, Ont.
  • Print_ISBN
    978-0-7695-2847-2
  • Type

    conf

  • DOI
    10.1109/AINAW.2007.97
  • Filename
    4221145