DocumentCode
24360
Title
A Novel Decision-Tree Method for Structured Continuous-Label Classification
Author
Hsiao-Wei Hu ; Yen-Liang Chen ; Kwei Tang
Author_Institution
Dept. of Inf. Manage., Fu-Jen Catholic Univ., Taipei, Taiwan
Volume
43
Issue
6
fYear
2013
fDate
Dec. 2013
Firstpage
1734
Lastpage
1746
Abstract
Structured continuous-label classification is a variety of classification in which the label is continuous in the data, but the goal is to classify data into classes that are a set of predefined ranges and can be organized in a hierarchy. In the hierarchy, the ranges at the lower levels are more specific and inherently more difficult to predict, whereas the ranges at the upper levels are less specific and inherently easier to predict. Therefore, both prediction specificity and prediction accuracy must be considered when building a decision tree (DT) from this kind of data. This paper proposes a novel classification algorithm for learning DT classifiers from data with structured continuous labels. This approach considers the distribution of labels throughout the hierarchical structure during the construction of trees without requiring discretization in the preprocessing stage. We compared the results of the proposed method with those of the C4.5 algorithm using eight real data sets. The empirical results indicate that the proposed method outperforms the C4.5 algorithm with regard to prediction accuracy, prediction specificity, and computational complexity.
Keywords
computational complexity; decision trees; pattern classification; C4.5 algorithm; DT classifier learning; computational complexity; data classification; decision-tree method; hierarchical structure; label distribution; prediction accuracy; prediction specificity; structured continuous-label classification; Accuracy; Argon; Heuristic algorithms; Partitioning algorithms; Prediction algorithms; Regression tree analysis; Testing; Classification algorithms; data mining; decision trees (DTs);
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TSMCB.2012.2229269
Filename
6418009
Link To Document