• DocumentCode
    1777078
  • Title

    An approach for classifying large dataset using ensemble classifiers

  • Author

    Abad, Sajad Khodarahmi Jahan ; Zare-Mirakabad, Mohammad-Reza ; Rezaeian, Mehdi

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Yazd Univ., Yazd, Iran
  • fYear
    2014
  • fDate
    29-30 Oct. 2014
  • Firstpage
    785
  • Lastpage
    789
  • Abstract
    Efficiency of general classification models in various problems is different according to the characteristics and the space of the problem. Even in a particular issue, it may not be distinguished a special privilege for a classifier method than the others. Ensemble classifier methods aim to combine the results of several classifiers to cover the deficiency of each classifier by others. This combination faces high computational complexity if it includes a lazy base classifier, especially when handling large datasets. In this paper a method is proposed to combine the results of classifiers, which uses clustering as a part of the training, resulting in reducing the computational complexity, while it provides an acceptable accuracy. In this method the base classifiers are trained by a part of the input dataset, first. Then, according to the labels defined by the base classifiers, the clusters are created for another part of dataset. Finally, the samples contained in the clusters, the cluster that each sample belongs to it, and the distance of each sample to the center of all clusters are given to an artificial neural network and the final class label of test data is determined by the neural network. Experiments on several datasets show advantages of proposed model.
  • Keywords
    computational complexity; data handling; neural nets; pattern classification; pattern clustering; artificial neural network; clustering; computational complexity; ensemble classifier methods; large dataset classification; large dataset handling; Accuracy; Classification algorithms; Clustering algorithms; Neural networks; Prediction algorithms; Training; Training data; base classifiers; classification; clustering; ensemble classifiers; large dataset;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Knowledge Engineering (ICCKE), 2014 4th International eConference on
  • Conference_Location
    Mashhad
  • Print_ISBN
    978-1-4799-5486-5
  • Type

    conf

  • DOI
    10.1109/ICCKE.2014.6993440
  • Filename
    6993440