DocumentCode
3517254
Title
A Three-Stage Method to Select Informative Genes from Gene Expression Data in Classifying Cancer Classes
Author
Mohamad, Mohd Saberi ; Omatu, Sigeru ; Deris, Safaai ; Yoshioka, Michifumi
Author_Institution
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
fYear
2010
fDate
27-29 Jan. 2010
Firstpage
158
Lastpage
163
Abstract
The process of gene selection for the cancer classification faces with a major problem due to the properties of the data such as the small number of samples compared to the huge number of genes, irrelevant genes, and noisy data. Hence, this paper aims to select a near-optimal (small) subset of informative genes that is most relevant for the cancer classification. To achieve the aim, a three-stage method has been proposed. It has three stages: (1) pre-selecting genes using a filter method; (2) optimizing the gene subset using a multi-objective hybrid method; (3) analyzing the frequency of appearance of each gene. By performing experiments on three public gene expression data sets, classification accuracies and the number of selected genes of the proposed method are better than those of other experimented methods and previous works. A list of informative genes in the final gene subsets is also presented for biological usage.
Keywords
biology computing; cancer; pattern classification; cancer classes cassification; filter method; gene expression data; gene subset; informative gene selection; multiobjective hybrid method; pre-selecting genes; Cancer; Computational modeling; Computer simulation; Filters; Frequency; Gene expression; Genetic algorithms; Intelligent systems; Support vector machine classification; Support vector machines; cancer classification; gene expression data; gene selection; genetic algorithm; three-stage method;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems, Modelling and Simulation (ISMS), 2010 International Conference on
Conference_Location
Liverpool
Print_ISBN
978-1-4244-5984-1
Type
conf
DOI
10.1109/ISMS.2010.39
Filename
5416103
Link To Document