• DocumentCode
    1863254
  • Title

    A hybrid approach for selecting gene subsets using gene expression data

  • Author

    Yang, Chang-san ; Chuang, Li-Yeh ; Ke, Chao-hsuan ; Yang, Cheng-Hong

  • Author_Institution
    Inst. of Biomed. Eng., Tainan
  • fYear
    2008
  • fDate
    25-27 June 2008
  • Firstpage
    159
  • Lastpage
    164
  • Abstract
    Many previous research papers have demonstrated that microarray gene expression data are useful for disease classification and medical diagnosis. Cancer microarray data normally have a particular characteristic where features (genes) greatly exceed the instance (tissue sample) numbers. Selecting appropriate numbers and relevant features to differentiate different types of cancer remains a challenge in bioinformatics. In order to select useful gene sets from microarray data to promote classification performance effectively, feature selection approaches were included in many previous literature reports. In this paper, a hybrid approach which combines correlation-based feature selection and binary particle swarm optimization was used to select few subsets, combined with the K-nearest neighbor method as a classifier to evaluate the classification performance. The proposed approach is applied on six microarray gene expression data sets that relate to human cancer. The experimental results show that the proposed approach selects a smaller number of feature subsets and obtains better classification accuracy.
  • Keywords
    bioinformatics; cancer; feature extraction; genetics; medical diagnostic computing; particle swarm optimisation; pattern classification; tumours; K-nearest neighbor method; binary particle swarm optimization; bioinformatics; cancer microarray data; correlation-based feature selection; disease classification; gene subset selection; medical diagnosis; microarray gene expression data; Computer applications; Computer industry; Gene expression; Feature selection; Microarray gene expression data; binary particle swarm optimization; correlation-based feature selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing in Industrial Applications, 2008. SMCia '08. IEEE Conference on
  • Conference_Location
    Muroran
  • Print_ISBN
    978-1-4244-3782-5
  • Electronic_ISBN
    978-4-9904-2590-6
  • Type

    conf

  • DOI
    10.1109/SMCIA.2008.5045953
  • Filename
    5045953