DocumentCode :
2551798
Title :
Scalable, Efficient, Stepwise-Optimal Feature Elimination in Support Vector Machines
Author :
Aksu, Yaman ; Kesidis, George ; Miller, David J.
Author_Institution :
Pennsylvania State Univ., University Park
fYear :
2007
fDate :
27-29 Aug. 2007
Firstpage :
75
Lastpage :
80
Abstract :
We address feature selection for support vector machines for the scenario in which the feature space is huge, i.e., 105 - 106 or more features, as may occur e.g. in a biomedical context working with 3-D (or 4-D) brain images. Feature selection in this case may be needed to improve the classifier\´s generalization performance (given limited training data), to reduce classification complexity, and/or to identify a minimum subset of features necessary for accurate classification, i.e., a set of putative "biomarkers". While there are a variety of techniques for SVM-based feature selection, many such may be unsuitable for huge feature spaces due to computational and/or memory requirements. One popular, lightweight scheme is recursive feature elimination (RFE), wherein the feature with smallest weight magnitude in the current solution is eliminated at each step. Here we propose an alternative to RFE that is stepwise superior in that it maximizes margin (in the separable case) and minimizes training error rate (in the non- separable case), rather than minimizing weight magnitude. Moreover, we formulate an algorithm that achieves this stepwise maximum margin feature elimination without requiring explicit margin evaluation for all the remaining (candidate) features - in this way, the method achieves reduced complexity. To date, we have only performed experiments on (modestly dimensioned) UC Irvine data sets, which demonstrate better classification accuracy of our scheme (both training and test) over RFE. At the workshop, we will present results on huge feature spaces, for disease classification of 3-D MRI brain images and on other data domains.
Keywords :
biomedical MRI; brain; feature extraction; image classification; medical image processing; support vector machines; 3D MRI images; biomedical images; brain images; disease classification; feature selection; stepwise maximum margin feature elimination; support vector machines; Biomarkers; Brain; Diseases; Error analysis; Filtering; Performance evaluation; Support vector machine classification; Support vector machines; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing, 2007 IEEE Workshop on
Conference_Location :
Thessaloniki
ISSN :
1551-2541
Print_ISBN :
978-1-4244-1565-6
Electronic_ISBN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2007.4414285
Filename :
4414285
Link To Document :
بازگشت