DocumentCode :
408369
Title :
Building decision trees using functional dependencies
Author :
Lam, Kwok-wa ; Lee, Victor C S
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
Volume :
2
fYear :
2004
fDate :
5-7 April 2004
Firstpage :
470
Abstract :
Decision tree (DT) induction is regarded as a representative of traditional approaches to classification rule mining which is an important technique for many data mining applications. Using a heuristic-based local search, DT induction appends attribute at a time to rules in the order of goodness. This method may eliminate some typical structures that several attributes collectively determine the class. Recently, there has been growing interest in the problem of discovering functional dependencies (FDs) from existing databases [[Flach et al.], [Y. Huhtala et al., (1999)], [Lopes et al.], [Novelli et al.]]. Some efficient and scalable algorithms have been proposed. In this paper, we present a new method to build a DT classifier using approximate FDs [Y. Huhtala et al., (1999)]. The new method is different from the traditional ways of building DTs in that it searches composite attributes for individual node of a DT which leads to substantially smaller and more understandable DTs without adversely affecting the accuracy gains. Experiments showed that the new method not only builds more accurate classifiers, but also does this with more compact structures.
Keywords :
data mining; decision trees; pattern classification; relational databases; search problems; classification rule mining; data mining; decision tree; functional dependency; heuristic-based local search; relational database; Application software; Buildings; Classification tree analysis; Computer science; Data mining; Data warehouses; Decision trees; Information technology; Relational databases; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
Type :
conf
DOI :
10.1109/ITCC.2004.1286698
Filename :
1286698
Link To Document :
بازگشت