Title :
Solving the fragmentation problem of decision trees by discovering boundary emerging patterns
Author :
Li, Jinyan ; Wong, Limsoon
Author_Institution :
Labs. for Inf. Technol., Singapore, Singapore
Abstract :
The single coverage constraint discourages a decision tree to contain many significant rules. The loss of significant rules leads to a loss in accuracy. On the other hand, the fragmentation problem causes a decision tree to contain too many minor rules. The presence of minor rules decreases the accuracy. We propose to use emerging patterns to solve these problems. In our approach, many globally significant rules can be discovered. Extensive expert. mental results on gene expression datasets show that our approach are more accurate than single C4.5 trees, and are also better than bagged or boosted C4.5 trees.
Keywords :
data mining; decision trees; decision trees; emerging pattern; fragmentation problem; gene expression datasets; minor rules; rule discovery; Classification tree analysis; Decision trees; Gene expression; Information technology; Laboratories; Neural networks; Support vector machine classification; Support vector machines; Testing; Training data;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1184021