DocumentCode
20757
Title
Sparse Extreme Learning Machine for Classification
Author
Zuo Bai ; Guang-Bin Huang ; Danwei Wang ; Han Wang ; Westover, M. Brandon
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume
44
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
1858
Lastpage
1870
Abstract
Extreme learning machine (ELM) was initially proposed for single-hidden-layer feedforward neural networks (SLFNs). In the hidden layer (feature mapping), nodes are randomly generated independently of training data. Furthermore, a unified ELM was proposed, providing a single framework to simplify and unify different learning methods, such as SLFNs, least square support vector machines, proximal support vector machines, and so on. However, the solution of unified ELM is dense, and thus, usually plenty of storage space and testing time are required for large-scale applications. In this paper, a sparse ELM is proposed as an alternative solution for classification, reducing storage space and testing time. In addition, unified ELM obtains the solution by matrix inversion, whose computational complexity is between quadratic and cubic with respect to the training size. It still requires plenty of training time for large-scale problems, even though it is much faster than many other traditional methods. In this paper, an efficient training algorithm is specifically developed for sparse ELM. The quadratic programming problem involved in sparse ELM is divided into a series of smallest possible sub-problems, each of which are solved analytically. Compared with SVM, sparse ELM obtains better generalization performance with much faster training speed. Compared with unified ELM, sparse ELM achieves similar generalization performance for binary classification applications, and when dealing with large-scale binary classification problems, sparse ELM realizes even faster training speed than unified ELM.
Keywords
generalisation (artificial intelligence); learning (artificial intelligence); matrix algebra; pattern classification; quadratic programming; ELM; SLFN; binary classification applications; cubic computational complexity; data classification solution; generalization performance; learning methods; least square support vector machines; matrix inversion; proximal support vector machines; quadratic computational complexity; quadratic programming problem; single-hidden-layer feedforward neural networks; sparse extreme learning machine; Kernel; Optimization; Sparse matrices; Support vector machines; Testing; Training; Training data; Classification; extreme learning machine (ELM); quadratic programming (QP); sequential minimal optimization (SMO); sparse ELM; support vector machine (SVM); unified ELM;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2014.2298235
Filename
6756997
Link To Document