DocumentCode
1777078
Title
An approach for classifying large dataset using ensemble classifiers
Author
Abad, Sajad Khodarahmi Jahan ; Zare-Mirakabad, Mohammad-Reza ; Rezaeian, Mehdi
Author_Institution
Sch. of Electr. & Comput. Eng., Yazd Univ., Yazd, Iran
fYear
2014
fDate
29-30 Oct. 2014
Firstpage
785
Lastpage
789
Abstract
Efficiency of general classification models in various problems is different according to the characteristics and the space of the problem. Even in a particular issue, it may not be distinguished a special privilege for a classifier method than the others. Ensemble classifier methods aim to combine the results of several classifiers to cover the deficiency of each classifier by others. This combination faces high computational complexity if it includes a lazy base classifier, especially when handling large datasets. In this paper a method is proposed to combine the results of classifiers, which uses clustering as a part of the training, resulting in reducing the computational complexity, while it provides an acceptable accuracy. In this method the base classifiers are trained by a part of the input dataset, first. Then, according to the labels defined by the base classifiers, the clusters are created for another part of dataset. Finally, the samples contained in the clusters, the cluster that each sample belongs to it, and the distance of each sample to the center of all clusters are given to an artificial neural network and the final class label of test data is determined by the neural network. Experiments on several datasets show advantages of proposed model.
Keywords
computational complexity; data handling; neural nets; pattern classification; pattern clustering; artificial neural network; clustering; computational complexity; ensemble classifier methods; large dataset classification; large dataset handling; Accuracy; Classification algorithms; Clustering algorithms; Neural networks; Prediction algorithms; Training; Training data; base classifiers; classification; clustering; ensemble classifiers; large dataset;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Knowledge Engineering (ICCKE), 2014 4th International eConference on
Conference_Location
Mashhad
Print_ISBN
978-1-4799-5486-5
Type
conf
DOI
10.1109/ICCKE.2014.6993440
Filename
6993440
Link To Document