Title :
Comparison of score metrics for Bayesian network learning
Author :
Yang, Shulin ; Chang, Kuo-Chu
Author_Institution :
Sch. of Inf. Technol. & Eng., George Mason Univ., Fairfax, VA, USA
fDate :
5/1/2002 12:00:00 AM
Abstract :
In order to induct a Bayesian network from data, researchers proposed a variety of score metrics based on different assumptions. The score metric that performs best is of interest. In this paper, we compared the performance of five score metrics: uniform prior score metric (UPSM), conditional uniform prior score metric (CUPSM), Dirichlet prior score metric (DPSM), likelihood-equivalence Bayesian Dirichlet score metric (BDe), and minimum description length (MDL); resulting from five different assumptions: uniform prior, conditional uniform prior, Dirichlet prior, likelihood equivalence, and MDL. We used a three-node net, a five-node net, and the ALARM net to conduct several comparison experiments. The experimental results show that when they are applied to identify the true network structures, the DPSM yields the best discrimination score and BDe may fail to identify the true network if the equivalent sample size is not set properly. When they are applied to learn a network from data using the K2-like greedy search and the maximum likelihood (ML) parameter estimation, the network inducted by the K2D10, corresponding to the tenth-order DPSM, is most similar to the true network based on the cross-entropy criterion. It is concluded that the tenth-order DPSM is the best score metric and the corresponding K2D10 is the most reliable network learning algorithm.
Keywords :
algorithm theory; belief networks; knowledge representation; learning (artificial intelligence); maximum likelihood estimation; ALARM net; Bayesian network learning; Dirichlet prior score metric; K2-like greedy search; conditional uniform prior score metric; likelihood-equivalence Bayesian Dirichlet score metric; maximum likelihood parameter estimation; minimum description length; network learning algorithm; score metrics; uniform prior score metric; Artificial intelligence; Bayesian methods; Computer networks; Databases; Learning; Length measurement; Maximum likelihood estimation; Network topology; Parameter estimation; Probability;
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
DOI :
10.1109/TSMCA.2002.803772