Title :
Optimizing Performance Measures for Feature Selection
Author :
Mao, Qi ; Tsang, Ivor Wai-Hung
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Feature selection with specific multivariate performance measures is the key to the success of many applications, such as information retrieval and bioinformatics. The existing feature selection methods are usually designed for classification error. In this paper, we present a unified feature selection framework for general loss functions. In particular, we study the novel feature selection paradigm by optimizing multivariate performance measures. The resultant formulation is a challenging problem for high-dimensional data. Hence, a two-layer cutting plane algorithm is proposed to solve this problem, and the convergence is presented. Extensive experiments on large-scale and high-dimensional real world datasets show that the proposed method outperforms l1-SVM and SVM-RFE when choosing a small subset of features, and achieves significantly improved performances over SVMperf in terms of F1-score.
Keywords :
optimisation; bioinformatics; classification error; feature selection; information retrieval; multivariate performance measure optimisation; multivariate performance measures; Accuracy; Approximation algorithms; Atmospheric measurements; Loss measurement; Particle measurements; Testing; Vectors; feature selection; multiple kernel learning; multivariate performance measure; structural SVMs;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.113