Sparse Extreme Learning Machine for Classification

Author

Zuo Bai ; Guang-Bin Huang ; Danwei Wang ; Han Wang ; Westover, M. Brandon

Author_Institution

Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore

Volume

44

Issue

10

fYear

2014

fDate

Oct. 2014

Firstpage

1858

Lastpage

1870

Abstract

Extreme learning machine (ELM) was initially proposed for single-hidden-layer feedforward neural networks (SLFNs). In the hidden layer (feature mapping), nodes are randomly generated independently of training data. Furthermore, a unified ELM was proposed, providing a single framework to simplify and unify different learning methods, such as SLFNs, least square support vector machines, proximal support vector machines, and so on. However, the solution of unified ELM is dense, and thus, usually plenty of storage space and testing time are required for large-scale applications. In this paper, a sparse ELM is proposed as an alternative solution for classification, reducing storage space and testing time. In addition, unified ELM obtains the solution by matrix inversion, whose computational complexity is between quadratic and cubic with respect to the training size. It still requires plenty of training time for large-scale problems, even though it is much faster than many other traditional methods. In this paper, an efficient training algorithm is specifically developed for sparse ELM. The quadratic programming problem involved in sparse ELM is divided into a series of smallest possible sub-problems, each of which are solved analytically. Compared with SVM, sparse ELM obtains better generalization performance with much faster training speed. Compared with unified ELM, sparse ELM achieves similar generalization performance for binary classification applications, and when dealing with large-scale binary classification problems, sparse ELM realizes even faster training speed than unified ELM.

Keywords

generalisation (artificial intelligence); learning (artificial intelligence); matrix algebra; pattern classification; quadratic programming; ELM; SLFN; binary classification applications; cubic computational complexity; data classification solution; generalization performance; learning methods; least square support vector machines; matrix inversion; proximal support vector machines; quadratic computational complexity; quadratic programming problem; single-hidden-layer feedforward neural networks; sparse extreme learning machine; Kernel; Optimization; Sparse matrices; Support vector machines; Testing; Training; Training data; Classification; extreme learning machine (ELM); quadratic programming (QP); sequential minimal optimization (SMO); sparse ELM; support vector machine (SVM); unified ELM;

fLanguage

English

Journal_Title

Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

2168-2267

Type

jour

DOI

10.1109/TCYB.2014.2298235

Filename

6756997