Title :
Text categorization rule extraction based on fuzzy decision tree
Author :
Wang, Yu ; Wang, Zheng-Ou
Author_Institution :
Inst. of Syst. Eng., Tianjin Univ., China
Abstract :
In this paper, a new method for text categorization rule extraction based on fuzzy decision tree is presented. An improved chi-square statistic is adopted. The new method reduces features of text in terms of the improved chi-square statistic, and so largely reduces the dimensions of the vector space. And then, a new method for the construction of membership functions is presented, which reduces the time of data fuzzification largely and increase categorization accuracy consequently. Finally, the fuzzy decision tree is applied to the text categorization. Both the understandable categorization rules and the better accuracy of categorization can be acquired.
Keywords :
data mining; decision trees; fuzzy set theory; text analysis; chi-square statistic; fuzzy decision tree; membership function construction; text categorization rule extraction; Classification tree analysis; Computer science; Data mining; Decision trees; Feature extraction; Fuzzy systems; Mathematics; Statistics; Systems engineering and theory; Text categorization; Feature Reduction; Fuzzy Decision Tree; Membership Functions; Rule Extraction;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527296