• DocumentCode
    2568314
  • Title

    A GA-SVM feature selection model based on high performance computing techniques

  • Author

    Zhang, Tianyou ; Fu, Xiuju ; Goh, Rick Siow Mong ; Kwoh, Chee Keong ; Lee, Gary Kee Khoon

  • Author_Institution
    Inst. of High Performance Comput., Singapore, Singapore
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    2653
  • Lastpage
    2658
  • Abstract
    Supervised learning is well-known and widely applied in many domains including bioinformatics, cheminformatics and financial forecasting. However, the interference from irrelevant features may lead to the poor accuracy of classifiers. As a popular feature selection model, GA-SVM is desirable in many of those cases to filter out irrelevant features and improve the learning performance subsequently. However, the high computational cost strongly discourages the application of GA-SVM in large-scale datasets. In this paper, an HPC-enabled GA-SVM (HGA-SVM) is proposed by integrating data parallelization, multithreading and heuristic techniques with the ultimate goal of robustness and low computational cost. Our proposed model is comprised of four improvement strategies: 1) GA parallelization, 2) SVM parallelization, 3) neighbor search and 4) evaluation caching. All the four strategies improve various aspects of the feature selection model and contribute collectively towards higher computational throughput.
  • Keywords
    data analysis; genetic algorithms; learning (artificial intelligence); multi-threading; support vector machines; GA parallelization; GA-SVM; SVM parallelization; data parallelization; evaluation caching; feature selection model; genetic algorithm; heuristic techniques; high performance computing techniques; large-scale datasets; multithreading; neighbor search; supervised learning; support vector machine; Bioinformatics; Computational efficiency; Filters; Genetic algorithms; High performance computing; Interference; Large-scale systems; Multithreading; Supervised learning; Support vector machines; HPC; genetic algorithm; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5346120
  • Filename
    5346120