DocumentCode :
984368
Title :
Robust rule-based prediction
Author :
Li, Jiuyong
Author_Institution :
Dept. of Mathematics & Comput., Univ. of Southern Queensland, Toowoomba, Qld.
Volume :
18
Issue :
8
fYear :
2006
Firstpage :
1043
Lastpage :
1054
Abstract :
This paper studies a problem of robust rule-based classification, i.e., making predictions in the presence of missing values in data. This study differs from other missing value handling research in that it does not handle missing values but builds a rule-based classification model to tolerate missing values. Based on a commonly used rule-based classification model, we characterize the robustness of a hierarchy of rule sets as k-optimal rule sets with the decreasing size corresponding to the decreasing robustness. We build classifiers based on k-optimal rule sets and show experimentally that they are more robust than some benchmark rule-based classifiers, such as C4.5rules and CBA. We also show that the proposed approach is better than two well-known missing value handling methods for missing values in test data
Keywords :
data handling; data mining; pattern classification; k-optimal rule sets; missing value handling method; rule-based classification; Benchmark testing; Costs; Data mining; Decision trees; Humidity; Performance evaluation; Rain; Robustness; System testing; Data mining; classification; robustness.; rule;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2006.129
Filename :
1644728
Link To Document :
بازگشت