Title :
Simple estimators for relational Bayesian classifiers
Author :
Neville, Jennifer ; Jensen, David ; Gallagher, Brian
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
We present the relational Bayesian classifier (RBC), a modification of the simple Bayesian classifier (SBC) for relational data. There exist several Bayesian classifiers that learn predictive models of relational data, but each uses a different estimation technique for modelling heterogeneous sets of attribute values. The effects of data characteristics on estimation have not been explored. We consider four simple estimation techniques and evaluate them on three real-world data sets. The estimator that assumes each multiset value is independently drawn from the same distribution (INDEPVAL) achieves the best empirical results. We examine bias and variance tradeoffs over a range of data sets and show that INDEPVAL´s ability to model more multiset information results in lower bias estimates and contributes to its superior performance.
Keywords :
belief networks; estimation theory; learning (artificial intelligence); relational databases; INDEPVAL; estimation technique; learning (artificial intelligence); multiset information; relational Bayesian classifiers; relational data sets; Bayesian methods; Computer science; Data mining; Drives; Laboratories; Motion pictures; Predictive models;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250989