• DocumentCode
    1763604
  • Title

    A Feature Selection Method for Multivariate Performance Measures

  • Author

    Qi Mao ; Tsang, Ivor Wai-Hung

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    35
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    2051
  • Lastpage
    2063
  • Abstract
    Feature selection with specific multivariate performance measures is the key to the success of many applications such as image retrieval and text classification. The existing feature selection methods are usually designed for classification error. In this paper, we propose a generalized sparse regularizer. Based on the proposed regularizer, we present a unified feature selection framework for general loss functions. In particular, we study the novel feature selection paradigm by optimizing multivariate performance measures. The resultant formulation is a challenging problem for high-dimensional data. Hence, a two-layer cutting plane algorithm is proposed to solve this problem, and the convergence is presented. In addition, we adapt the proposed method to optimize multivariate measures for multiple-instance learning problems. The analyses by comparing with the state-of-the-art feature selection methods show that the proposed method is superior to others. Extensive experiments on large-scale and high-dimensional real-world datasets show that the proposed method outperforms l1-SVM and SVM-RFE when choosing a small subset of features, and achieves significantly improved performances over SVMperl in terms of F1-score.
  • Keywords
    learning (artificial intelligence); support vector machines; SVM-RFE; classification error; cutting plane algorithm; general loss functions; generalized sparse regularizer; image retrieval; l1-SVM; multivariate performance measurement; state-of-the-art feature selection methods; text classification; unified feature selection framework; Convergence; Error analysis; Kernel; Loss measurement; Optimization; Support vector machines; Vectors; Feature selection; multi-instance learning; multiple kernel learning; performance measure; structural SVMs;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2012.266
  • Filename
    6389678