Title :
A Novel Decision-Tree Method for Structured Continuous-Label Classification
Author :
Hsiao-Wei Hu ; Yen-Liang Chen ; Kwei Tang
Author_Institution :
Dept. of Inf. Manage., Fu-Jen Catholic Univ., Taipei, Taiwan
Abstract :
Structured continuous-label classification is a variety of classification in which the label is continuous in the data, but the goal is to classify data into classes that are a set of predefined ranges and can be organized in a hierarchy. In the hierarchy, the ranges at the lower levels are more specific and inherently more difficult to predict, whereas the ranges at the upper levels are less specific and inherently easier to predict. Therefore, both prediction specificity and prediction accuracy must be considered when building a decision tree (DT) from this kind of data. This paper proposes a novel classification algorithm for learning DT classifiers from data with structured continuous labels. This approach considers the distribution of labels throughout the hierarchical structure during the construction of trees without requiring discretization in the preprocessing stage. We compared the results of the proposed method with those of the C4.5 algorithm using eight real data sets. The empirical results indicate that the proposed method outperforms the C4.5 algorithm with regard to prediction accuracy, prediction specificity, and computational complexity.
Keywords :
computational complexity; decision trees; pattern classification; C4.5 algorithm; DT classifier learning; computational complexity; data classification; decision-tree method; hierarchical structure; label distribution; prediction accuracy; prediction specificity; structured continuous-label classification; Accuracy; Argon; Heuristic algorithms; Partitioning algorithms; Prediction algorithms; Regression tree analysis; Testing; Classification algorithms; data mining; decision trees (DTs);
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2012.2229269