Title :
Learning conditional independence tree for ranking
Author :
Su, Jiang ; Zhang, Harry
Author_Institution :
Fac. of Comput. Sci., New Brunswick Univ., Fredericton, NB, Canada
Abstract :
Accurate ranking is desired in many real-world data mining applications. Traditional learning algorithms, however, aim only at high classification accuracy. It has been observed that both traditional decision trees and naive Bayes produce good classification accuracy but poor probability estimates. In this paper, we use a new model, conditional independence tree (CITree), which is a combination of decision tree and naive Bayes and more suitable for ranking and more learnable in practice. We propose a novel algorithm for learning CITree for ranking, and the experiments show that the CITree algorithm outperforms the state-of-the-art decision tree learning algorithm C4.4 and naive Bayes significantly in yielding accurate rankings. Our work provides an effective data mining algorithm for applications in which an accurate ranking is required.
Keywords :
Bayes methods; data mining; decision trees; learning (artificial intelligence); accurate ranking; conditional independence tree; data mining; decision trees; learning algorithm; naive Bayes method; Application software; Chromium; Classification tree analysis; Computer science; Data mining; Decision trees; Error analysis; Frequency estimation; Niobium; Probability;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10021