• DocumentCode
    2328803
  • Title

    Simultaneous informative gene selection and clustering through multiobjective optimization

  • Author

    Mukhopadhyay, Anirban ; Maulik, Ujjwal ; Bandyopadhyay, Sanghamitra

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Kalyani, Kalyani, India
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Clustering methods are used for unsupervised classification of tumor subclasses in microarray gene expression data sets organized in a fashion where the rows represent the tumor samples and columns represent the genes. Clustering algorithms can be very sensitive with respect to the set of features (genes) considered in the clustering process. It is important to select the set of informative and relevant genes to be used for clustering. In this article, a multiobjective genetic algorithm based technique has been proposed for performing the tasks of gene selection and fuzzy clustering simultaneously. A novel encoding technique is developed in this regard and the algorithm searches for the best cluster centers while minimizing the number of selected genes. The number of clusters is evolved automatically. The performance of the proposed technique has been illustrated on an artificial data set and compared with that of several other related feature selection/clustering approaches. Moreover its performance is demonstrated on two real life multi-class gene expression data sets viz., Brain tumor and Lung tumor data sets.
  • Keywords
    bioinformatics; encoding; genetic algorithms; genomics; medical computing; pattern classification; pattern clustering; tumours; artificial data set; encoding technique; fuzzy clustering; informative gene clustering; informative gene selection; microarray gene expression; multiobjective genetic algorithm; multiobjective optimization; tumor subclass classification; unsupervised classification; Biological cells; Cancer; Clustering algorithms; Gene expression; Indexes; Lungs; Tumors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2010 IEEE Congress on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4244-6909-3
  • Type

    conf

  • DOI
    10.1109/CEC.2010.5586207
  • Filename
    5586207