DocumentCode
2568314
Title
A GA-SVM feature selection model based on high performance computing techniques
Author
Zhang, Tianyou ; Fu, Xiuju ; Goh, Rick Siow Mong ; Kwoh, Chee Keong ; Lee, Gary Kee Khoon
Author_Institution
Inst. of High Performance Comput., Singapore, Singapore
fYear
2009
fDate
11-14 Oct. 2009
Firstpage
2653
Lastpage
2658
Abstract
Supervised learning is well-known and widely applied in many domains including bioinformatics, cheminformatics and financial forecasting. However, the interference from irrelevant features may lead to the poor accuracy of classifiers. As a popular feature selection model, GA-SVM is desirable in many of those cases to filter out irrelevant features and improve the learning performance subsequently. However, the high computational cost strongly discourages the application of GA-SVM in large-scale datasets. In this paper, an HPC-enabled GA-SVM (HGA-SVM) is proposed by integrating data parallelization, multithreading and heuristic techniques with the ultimate goal of robustness and low computational cost. Our proposed model is comprised of four improvement strategies: 1) GA parallelization, 2) SVM parallelization, 3) neighbor search and 4) evaluation caching. All the four strategies improve various aspects of the feature selection model and contribute collectively towards higher computational throughput.
Keywords
data analysis; genetic algorithms; learning (artificial intelligence); multi-threading; support vector machines; GA parallelization; GA-SVM; SVM parallelization; data parallelization; evaluation caching; feature selection model; genetic algorithm; heuristic techniques; high performance computing techniques; large-scale datasets; multithreading; neighbor search; supervised learning; support vector machine; Bioinformatics; Computational efficiency; Filters; Genetic algorithms; High performance computing; Interference; Large-scale systems; Multithreading; Supervised learning; Support vector machines; HPC; genetic algorithm; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location
San Antonio, TX
ISSN
1062-922X
Print_ISBN
978-1-4244-2793-2
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2009.5346120
Filename
5346120
Link To Document