DocumentCode
424334
Title
Research on the relationship between some important split measure functions for decision tree with purity law
Author
Shao, W.A. ; Zhao, Hong
Author_Institution
Northeastern Univ. Software Center, Northeastern Univ., Shenyang, China
Volume
2
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
1267
Abstract
This paper analysis some split measure functions of classical decision tree algorithm. After the research on the structure of these functions, we found out all of them are separable probability measure function, and their core functions are semi-purity functions. They achieve their minimums at row-column independent point, maximums at full-distinguish point, and accordance with purity law. Because chi-square does not support the symmetry, the purity law proposed has wider adaptability than the impurity theory. These can help us to analysis the theory of measure function and the relationship between measure functions and data, and it is important to find some more simple and effective split-measure functions in some special area.
Keywords
data mining; decision trees; learning (artificial intelligence); probability; tree data structures; chi-square; data mining; decision tree algorithm; machine-learning; probability measure function; purity law; row-column independent point; split measure function; Algorithm design and analysis; Area measurement; Board of Directors; Data analysis; Decision trees; Density measurement; Impurities; Mathematics; Software algorithms; Software measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1382387
Filename
1382387
Link To Document