• DocumentCode
    20757
  • Title

    Sparse Extreme Learning Machine for Classification

  • Author

    Zuo Bai ; Guang-Bin Huang ; Danwei Wang ; Han Wang ; Westover, M. Brandon

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    44
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1858
  • Lastpage
    1870
  • Abstract
    Extreme learning machine (ELM) was initially proposed for single-hidden-layer feedforward neural networks (SLFNs). In the hidden layer (feature mapping), nodes are randomly generated independently of training data. Furthermore, a unified ELM was proposed, providing a single framework to simplify and unify different learning methods, such as SLFNs, least square support vector machines, proximal support vector machines, and so on. However, the solution of unified ELM is dense, and thus, usually plenty of storage space and testing time are required for large-scale applications. In this paper, a sparse ELM is proposed as an alternative solution for classification, reducing storage space and testing time. In addition, unified ELM obtains the solution by matrix inversion, whose computational complexity is between quadratic and cubic with respect to the training size. It still requires plenty of training time for large-scale problems, even though it is much faster than many other traditional methods. In this paper, an efficient training algorithm is specifically developed for sparse ELM. The quadratic programming problem involved in sparse ELM is divided into a series of smallest possible sub-problems, each of which are solved analytically. Compared with SVM, sparse ELM obtains better generalization performance with much faster training speed. Compared with unified ELM, sparse ELM achieves similar generalization performance for binary classification applications, and when dealing with large-scale binary classification problems, sparse ELM realizes even faster training speed than unified ELM.
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); matrix algebra; pattern classification; quadratic programming; ELM; SLFN; binary classification applications; cubic computational complexity; data classification solution; generalization performance; learning methods; least square support vector machines; matrix inversion; proximal support vector machines; quadratic computational complexity; quadratic programming problem; single-hidden-layer feedforward neural networks; sparse extreme learning machine; Kernel; Optimization; Sparse matrices; Support vector machines; Testing; Training; Training data; Classification; extreme learning machine (ELM); quadratic programming (QP); sequential minimal optimization (SMO); sparse ELM; support vector machine (SVM); unified ELM;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2298235
  • Filename
    6756997