Title :
An entropy-based discretization method for classification rules with inconsistency checking
Author :
Li, Ren-Pu ; Wang, Zheng-Ou
Author_Institution :
Inst. of Syst. Eng., Tianjin Univ., China
Abstract :
Discretization is an effective technique in handling continuous attributes for data mining, especially for classification problems. Most entropy-based discretization methods are local and it is easy to lose valuable information in the data. We present an entropy-based algorithm. Through inconsistency checking, we may add/delete cut points on the basis of a preliminary discretization scheme. So the interaction between all attributes is taken into consideration in the discretization process which makes our method possess a global property. Experimental results indicate that with the same rule generator C4.5, our method can produce stronger rules than existing entropy-based discretization methods.
Keywords :
data mining; entropy; learning (artificial intelligence); pattern classification; set theory; C4.5; classification rules; continuous attributes; cut points; data mining; entropy-based discretization method; global property; inconsistency checking; rule generator; Cybernetics; Data engineering; Data mining; Electronic mail; Entropy; Machine learning; Systems engineering and theory;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1176748