Title :
Multi-Class L2,1-Norm Support Vector Machine
Author :
Cai, Xiao ; Nie, Feiping ; Huang, Heng ; Ding, Chris
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
Abstract :
Feature selection is an essential component of data mining. In many data analysis tasks where the number of data point is much less than the number of features, efficient feature selection approaches are desired to extract meaningful features and to eliminate redundant ones. In the previous study, many data mining techniques have been applied to tackle the above challenging problem. In this paper, we propose a new ℓ2,1-norm SVM, that is, multi-class hinge loss with a structured regularization term for all the classes to naturally select features for multi-class without bothering further heuristic strategy. Rather than directly solving the multi-class hinge loss with ℓ2,1-norm regularization minimization, which has not been solved before due to its optimization difficulty, we are the first to give an efficient algorithm bridging the new problem with a previous solvable optimization problem to do multi-class feature selection. A global convergence proof for our method is also presented. Via the proposed efficient algorithm, we select features across multiple classes with jointly sparsity, i.e., each feature has either small or large score over all classes. Comprehensive experiments have been performed on six bioinformatics data sets to show that our method can obtain better or competitive performance compared with exiting state-of-art multi-class feature selection approaches.
Keywords :
bioinformatics; data analysis; data mining; optimisation; support vector machines; bioinformatics data sets; data analysis tasks; data mining; multiclass L2,1-norm support vector machine; multiclass feature selection; multiclass hinge loss; optimization difficulty; solvable optimization problem; structured regularization term; Convergence; Data mining; Fasteners; Feature extraction; Lungs; Optimization; Support vector machines; Support Vector Machine; feature selection; multi-class feature selection;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.105