• DocumentCode
    3172
  • Title

    Effective Classification Using a Small Training Set Based on Discretization and Statistical Analysis

  • Author

    Bruni, Renato ; Bianchi, Gianpiero

  • Author_Institution
    Dept. of Comput., Control & Manage. Eng., Univ. of Rome “Sapienza”, Rome, Italy
  • Volume
    27
  • Issue
    9
  • fYear
    2015
  • fDate
    Sept. 1 2015
  • Firstpage
    2349
  • Lastpage
    2361
  • Abstract
    This work deals with the problem of producing a fast and accurate data classification, learning it from a possibly small set of records that are already classified. The proposed approach is based on the framework of the so-called Logical Analysis of Data (LAD), but enriched with information obtained from statistical considerations on the data. A number of discrete optimization problems are solved in the different steps of the procedure, but their computational demand can be controlled. The accuracy of the proposed approach is compared to that of the standard LAD algorithm, of support vector machines and of label propagation algorithm on publicly available datasets of the UCI repository. Encouraging results are obtained and discussed.
  • Keywords
    classification; data analysis; formal logic; learning (artificial intelligence); statistical analysis; support vector machines; LAD algorithm; UCI repository; data classification; discretization; label propagation algorithm; learning; logical analysis of data; small training set; statistical analysis; support vector machines; Accuracy; Decision trees; Machine learning algorithms; Prediction algorithms; Standards; Support vector machines; Training; Classification Algorithms; Classification algorithms; Data Mining; Discrete Mathematics; Machine Learning; Optimization; data mining; discrete mathematics; machine learning; optimization;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2416727
  • Filename
    7069208