• DocumentCode
    24360
  • Title

    A Novel Decision-Tree Method for Structured Continuous-Label Classification

  • Author

    Hsiao-Wei Hu ; Yen-Liang Chen ; Kwei Tang

  • Author_Institution
    Dept. of Inf. Manage., Fu-Jen Catholic Univ., Taipei, Taiwan
  • Volume
    43
  • Issue
    6
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    1734
  • Lastpage
    1746
  • Abstract
    Structured continuous-label classification is a variety of classification in which the label is continuous in the data, but the goal is to classify data into classes that are a set of predefined ranges and can be organized in a hierarchy. In the hierarchy, the ranges at the lower levels are more specific and inherently more difficult to predict, whereas the ranges at the upper levels are less specific and inherently easier to predict. Therefore, both prediction specificity and prediction accuracy must be considered when building a decision tree (DT) from this kind of data. This paper proposes a novel classification algorithm for learning DT classifiers from data with structured continuous labels. This approach considers the distribution of labels throughout the hierarchical structure during the construction of trees without requiring discretization in the preprocessing stage. We compared the results of the proposed method with those of the C4.5 algorithm using eight real data sets. The empirical results indicate that the proposed method outperforms the C4.5 algorithm with regard to prediction accuracy, prediction specificity, and computational complexity.
  • Keywords
    computational complexity; decision trees; pattern classification; C4.5 algorithm; DT classifier learning; computational complexity; data classification; decision-tree method; hierarchical structure; label distribution; prediction accuracy; prediction specificity; structured continuous-label classification; Accuracy; Argon; Heuristic algorithms; Partitioning algorithms; Prediction algorithms; Regression tree analysis; Testing; Classification algorithms; data mining; decision trees (DTs);
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TSMCB.2012.2229269
  • Filename
    6418009