DocumentCode :
3172
Title :
Effective Classification Using a Small Training Set Based on Discretization and Statistical Analysis
Author :
Bruni, Renato ; Bianchi, Gianpiero
Author_Institution :
Dept. of Comput., Control & Manage. Eng., Univ. of Rome “Sapienza”, Rome, Italy
Volume :
27
Issue :
9
fYear :
2015
fDate :
Sept. 1 2015
Firstpage :
2349
Lastpage :
2361
Abstract :
This work deals with the problem of producing a fast and accurate data classification, learning it from a possibly small set of records that are already classified. The proposed approach is based on the framework of the so-called Logical Analysis of Data (LAD), but enriched with information obtained from statistical considerations on the data. A number of discrete optimization problems are solved in the different steps of the procedure, but their computational demand can be controlled. The accuracy of the proposed approach is compared to that of the standard LAD algorithm, of support vector machines and of label propagation algorithm on publicly available datasets of the UCI repository. Encouraging results are obtained and discussed.
Keywords :
classification; data analysis; formal logic; learning (artificial intelligence); statistical analysis; support vector machines; LAD algorithm; UCI repository; data classification; discretization; label propagation algorithm; learning; logical analysis of data; small training set; statistical analysis; support vector machines; Accuracy; Decision trees; Machine learning algorithms; Prediction algorithms; Standards; Support vector machines; Training; Classification Algorithms; Classification algorithms; Data Mining; Discrete Mathematics; Machine Learning; Optimization; data mining; discrete mathematics; machine learning; optimization;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2015.2416727
Filename :
7069208
Link To Document :
بازگشت