Title :
A global discretization method based on rough sets
Author :
Shi, Hong ; Fu, Jin-Zong
Author_Institution :
Dept. of Comput. Sci., Tianjin Univ., China
Abstract :
Since rough sets theory can unveil the dependency of data and implement data reduction, it has been increasingly researched in more and more fields. In rough sets theory and other induction learning systems, discretization is an important algorithm and can be viewed as a process of information generalization (or abstraction) and data reduction. In this paper, a global discretization algorithm is proposed based on rough sets. It modifies the criterion of selecting the best cut points, and introduces inconsistency checking to preserve the fidelity of the original data, which change the MDLP method into a global one. Thus the reduction of cut points can be performed while keeping the consistency level. The proposed algorithm is tested by using several data sets with ID3 and ROSETTA. Experiments results show that this method performs better than MDLP, and is also superior to those which process continuous data directly without discretization.
Keywords :
data reduction; generalisation (artificial intelligence); learning (artificial intelligence); rough set theory; cut point; data dependency; data reduction; global discretization; inconsistency checking; induction learning; information generalization; rough sets; Acceleration; Accuracy; Computer science; Cybernetics; Learning systems; Machine learning; Rough sets; Testing; Consistency; Discretization; Reduction; Rough sets;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527466