DocumentCode
3172
Title
Effective Classification Using a Small Training Set Based on Discretization and Statistical Analysis
Author
Bruni, Renato ; Bianchi, Gianpiero
Author_Institution
Dept. of Comput., Control & Manage. Eng., Univ. of Rome “Sapienza”, Rome, Italy
Volume
27
Issue
9
fYear
2015
fDate
Sept. 1 2015
Firstpage
2349
Lastpage
2361
Abstract
This work deals with the problem of producing a fast and accurate data classification, learning it from a possibly small set of records that are already classified. The proposed approach is based on the framework of the so-called Logical Analysis of Data (LAD), but enriched with information obtained from statistical considerations on the data. A number of discrete optimization problems are solved in the different steps of the procedure, but their computational demand can be controlled. The accuracy of the proposed approach is compared to that of the standard LAD algorithm, of support vector machines and of label propagation algorithm on publicly available datasets of the UCI repository. Encouraging results are obtained and discussed.
Keywords
classification; data analysis; formal logic; learning (artificial intelligence); statistical analysis; support vector machines; LAD algorithm; UCI repository; data classification; discretization; label propagation algorithm; learning; logical analysis of data; small training set; statistical analysis; support vector machines; Accuracy; Decision trees; Machine learning algorithms; Prediction algorithms; Standards; Support vector machines; Training; Classification Algorithms; Classification algorithms; Data Mining; Discrete Mathematics; Machine Learning; Optimization; data mining; discrete mathematics; machine learning; optimization;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2015.2416727
Filename
7069208
Link To Document