• DocumentCode
    1420560
  • Title

    A Unified Feature and Instance Selection Framework Using Optimum Experimental Design

  • Author

    Zhang, Lijun ; Chen, Chun ; Bu, Jiajun ; He, Xiaofei

  • Author_Institution
    Zhejiang Provincial Key Lab. of Service Robot, Zhejiang Univ., Hangzhou, China
  • Volume
    21
  • Issue
    5
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    2379
  • Lastpage
    2388
  • Abstract
    The goal of feature selection is to identify the most informative features for compact representation, whereas the goal of active learning is to select the most informative instances for prediction. Previous studies separately address these two problems, despite of the fact that selecting features and instances are dual operations over a data matrix. In this paper, we consider the novel problem of simultaneously selecting the most informative features and instances and develop a solution from the perspective of optimum experimental design. That is, by using the selected features as the new representation and the selected instances as training data, the variance of the parameter estimate of a learning function can be minimized. Specifically, we propose a novel approach, which is called Unified criterion for Feature and Instance selection (UFI), to simultaneously identify the most informative features and instances that minimize the trace of the parameter covariance matrix. A greedy algorithm is introduced to efficiently solve the optimization problem. Experimental results on two benchmark data sets demonstrate the effectiveness of our proposed method.
  • Keywords
    covariance matrices; data structures; feature extraction; greedy algorithms; image representation; learning (artificial intelligence); minimisation; parameter estimation; active learning; data matrix; data representation; feature identification; feature representation; greedy algorithm; instance selection; minimization; parameter covariance matrix; parameter estimation; unified feature selection; Accuracy; Algorithm design and analysis; Covariance matrix; Optimization; Support vector machines; Training; US Department of Defense; Active learning; experimental design; feature selection; instance selection; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2012.2183879
  • Filename
    6129509