Title :
Research on algorithm of decision tree induction
Author :
Ding, Hua ; Wang, Xiu-Kun
Author_Institution :
Dept. of Comput. Sci., Dalian Univ. of Technol., China
Abstract :
This paper introduces the development trend of decision trees in the past several years. During these years, the ID3 algorithm has been at its highest point in decision trees. ID3 makes use of information entropy as heuristics to select a "excellent attribution" at each node, so it can get a smaller depth but not the proper width of the tree, i.e., if the width is large, no matter how small the decision tree is, the leaf node will not be small. It is very important that smaller nodes get higher classify precision for decision trees. This paper provides two algorithms, which can avoid the deficiency of ID3 and reduce the width of the tree to get a better result - the one is probability based PID, and the other is entropy based EMID.
Keywords :
decision trees; entropy; learning (artificial intelligence); probability; ID3 algorithm; decision tree induction; growing algorithm; information entropy; learning algorithm; probability; pruning algorithm; Classification tree analysis; Computer science; Decision trees; Gain measurement; Industrial control; Information entropy; Machine learning; Machine learning algorithms; Mutual information; Parallel algorithms;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1174546