• DocumentCode
    1070818
  • Title

    A Dynamic Discretization Approach for Constructing Decision Trees with a Continuous Label

  • Author

    Hu, Hsiao-Wei ; Chen, Yen-Liang ; Tang, Kwei

  • Author_Institution
    Dept. of Inf. Manage., Nat. Central Univ., Chungli, Taiwan
  • Volume
    21
  • Issue
    11
  • fYear
    2009
  • Firstpage
    1505
  • Lastpage
    1514
  • Abstract
    In traditional decision (classification) tree algorithms, the label is assumed to be a categorical (class) variable. When the label is a continuous variable in the data, two possible approaches based on existing decision tree algorithms can be used to handle the situations. The first uses a data discretization method in the preprocessing stage to convert the continuous label into a class label defined by a finite set of nonoverlapping intervals and then applies a decision tree algorithm. The second simply applies a regression tree algorithm, using the continuous label directly. These approaches have their own drawbacks. We propose an algorithm that dynamically discretizes the continuous label at each node during the tree induction process. Extensive experiments show that the proposed method outperforms the preprocessing approach, the regression tree approach, and several nontree-based algorithms.
  • Keywords
    data mining; decision trees; continuous label; data mining; decision trees; dynamic discretization; regression tree algorithm; Classification algorithms; Classification tree analysis; Clustering algorithms; Data mining; Decision trees; Heuristic algorithms; Insurance; Partitioning algorithms; Regression tree analysis; Training data; Decision trees; classification.; data mining;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.24
  • Filename
    4752827