• DocumentCode
    464271
  • Title

    Boosting Evolutionary Support Vector Machine for Designing Tumor Classifiers from Microarray Data

  • Author

    Huang, Hui-Ling ; Chen, Yi-Hsiung ; Koeberl, Dwight D. ; Ho, Shinn-Ying

  • Author_Institution
    Dept. of Inf. Manage., Jin Wen Inst. of Technol., Taipei
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    32
  • Lastpage
    38
  • Abstract
    Since there are multiple sets of relevant genes having the same high accuracy in fitting training data called model uncertainty, to identify a small set of informative genes from microarray data for designing an accurate tumor classifier for unknown samples is intractable. Support vector machine (SVM), a supervised machine learning technique, is one of the methods successfully applied to cancer diagnosis problems. This study proposes an SVM-based classifier with automatic feature selection associated with a boosting strategy. The proposed boosting evolutionary support vector machine (named BESVM) hybridizes the advantages of SVM, boosting using a majority-voting ensemble and an intelligent genetic algorithm for gene selection. The merits of the BESVM-based classifier are threefold: 1) a small set of used genes, 2) accurate test classification using leave-one-out cross-validation, and 3) robust performance by avoiding overfitting training data. Five benchmark datasets were used to evaluate the BESVM-based classifier. Simulation results reveal that BESVM performs well having a mean accuracy 94.26% using only 10.1 genes averagely, compared with the existing SVM and non-SVM based classifiers
  • Keywords
    biology computing; genetic algorithms; pattern classification; support vector machines; tumours; automatic feature selection; boosting evolutionary support vector machine; gene selection; intelligent genetic algorithm; majority-voting ensemble; microarray data; model uncertainty; supervised machine learning; tumor classifiers; Boosting; Cancer; Genetic algorithms; Learning systems; Machine learning; Neoplasms; Support vector machine classification; Support vector machines; Training data; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221201
  • Filename
    4221201