Title :
CTC - correlating tree patterns for classification
Author :
Zimmermann, Albrecht ; Bringmann, Bjorn
Author_Institution :
Machine Learning Lab, Albert-Ludwigs-Univ. Freiburg, Germany
Abstract :
We present CTC, a new approach to structural classification. It uses the predictive power of tree patterns correlating with the class values, combining state-of-the-art tree mining with sophisticated pruning techniques to find the k most discriminative pattern in a dataset. In contrast to existing methods, CTC uses no heuristics and the only parameters to be chosen by the user are the maximum size of the rule set and a single, statistically well founded cut-off value. The experiments show that CTC classifiers achieve good accuracies while the induced models are smaller than those of existing approaches, facilitating comprehensibility.
Keywords :
data mining; pattern classification; trees (mathematics); k most discriminative pattern; pruning technique; structural classification; tree mining; tree pattern correlation; Association rules; Classification tree analysis; Data mining; Drugs; Electronic mail; Frequency measurement; Machine learning; Support vector machines; Tree graphs; XML;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.49