• DocumentCode
    424334
  • Title

    Research on the relationship between some important split measure functions for decision tree with purity law

  • Author

    Shao, W.A. ; Zhao, Hong

  • Author_Institution
    Northeastern Univ. Software Center, Northeastern Univ., Shenyang, China
  • Volume
    2
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    1267
  • Abstract
    This paper analysis some split measure functions of classical decision tree algorithm. After the research on the structure of these functions, we found out all of them are separable probability measure function, and their core functions are semi-purity functions. They achieve their minimums at row-column independent point, maximums at full-distinguish point, and accordance with purity law. Because chi-square does not support the symmetry, the purity law proposed has wider adaptability than the impurity theory. These can help us to analysis the theory of measure function and the relationship between measure functions and data, and it is important to find some more simple and effective split-measure functions in some special area.
  • Keywords
    data mining; decision trees; learning (artificial intelligence); probability; tree data structures; chi-square; data mining; decision tree algorithm; machine-learning; probability measure function; purity law; row-column independent point; split measure function; Algorithm design and analysis; Area measurement; Board of Directors; Data analysis; Decision trees; Density measurement; Impurities; Mathematics; Software algorithms; Software measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382387
  • Filename
    1382387