• DocumentCode
    3517254
  • Title

    A Three-Stage Method to Select Informative Genes from Gene Expression Data in Classifying Cancer Classes

  • Author

    Mohamad, Mohd Saberi ; Omatu, Sigeru ; Deris, Safaai ; Yoshioka, Michifumi

  • Author_Institution
    Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
  • fYear
    2010
  • fDate
    27-29 Jan. 2010
  • Firstpage
    158
  • Lastpage
    163
  • Abstract
    The process of gene selection for the cancer classification faces with a major problem due to the properties of the data such as the small number of samples compared to the huge number of genes, irrelevant genes, and noisy data. Hence, this paper aims to select a near-optimal (small) subset of informative genes that is most relevant for the cancer classification. To achieve the aim, a three-stage method has been proposed. It has three stages: (1) pre-selecting genes using a filter method; (2) optimizing the gene subset using a multi-objective hybrid method; (3) analyzing the frequency of appearance of each gene. By performing experiments on three public gene expression data sets, classification accuracies and the number of selected genes of the proposed method are better than those of other experimented methods and previous works. A list of informative genes in the final gene subsets is also presented for biological usage.
  • Keywords
    biology computing; cancer; pattern classification; cancer classes cassification; filter method; gene expression data; gene subset; informative gene selection; multiobjective hybrid method; pre-selecting genes; Cancer; Computational modeling; Computer simulation; Filters; Frequency; Gene expression; Genetic algorithms; Intelligent systems; Support vector machine classification; Support vector machines; cancer classification; gene expression data; gene selection; genetic algorithm; three-stage method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, Modelling and Simulation (ISMS), 2010 International Conference on
  • Conference_Location
    Liverpool
  • Print_ISBN
    978-1-4244-5984-1
  • Type

    conf

  • DOI
    10.1109/ISMS.2010.39
  • Filename
    5416103