• DocumentCode
    775422
  • Title

    Efficient Sparse Kernel Feature Extraction Based on Partial Least Squares

  • Author

    Dhanjal, Charanpal ; Gunn, Steve R. ; Shawe-Taylor, John

  • Author_Institution
    Inf.: Signals Images Syst. Res. Group, Univ. of Southampton, Southampton
  • Volume
    31
  • Issue
    8
  • fYear
    2009
  • Firstpage
    1347
  • Lastpage
    1361
  • Abstract
    The presence of irrelevant features in training data is a significant obstacle for many machine learning tasks. One approach to this problem is to extract appropriate features and, often, one selects a feature extraction method based on the inference algorithm. Here, we formalize a general framework for feature extraction, based on partial least squares, in which one can select a user-defined criterion to compute projection directions. The framework draws together a number of existing results and provides additional insights into several popular feature extraction methods. Two new sparse kernel feature extraction methods are derived under the framework, called sparse maximal alignment (SMA) and sparse maximal covariance (SMC), respectively. Key advantages of these approaches include simple implementation and a training time which scales linearly in the number of examples. Furthermore, one can project a new test example using only k kernel evaluations, where k is the output dimensionality. Computational results on several real-world data sets show that SMA and SMC extract features which are as predictive as those found using other popular feature extraction methods. Additionally, on large text retrieval and face detection data sets, they produce features which match the performance of the original ones in conjunction with a support vector machine.
  • Keywords
    covariance matrices; face recognition; feature extraction; image retrieval; inference mechanisms; learning (artificial intelligence); least squares approximations; sparse matrices; support vector machines; text analysis; face detection; inference algorithm; machine learning; partial least squares; sparse kernel feature extraction method; sparse maximal alignment; sparse maximal covariance; support vector machine; text retrieval; training data; Feature extraction or construction; Machine learning; feature extraction; kernel methods; partial least squares (PLS).; Algorithms; Artificial Intelligence; Databases, Factual; Face; Humans; Least-Squares Analysis; Pattern Recognition, Automated; Principal Component Analysis; ROC Curve;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2008.171
  • Filename
    4553716