Title :
Identifying Markov blankets with decision tree induction
Author :
Frey, Lewis ; Fisher, Douglas ; Tsamardinos, Ioannis ; Aliferis, Constantin F. ; Statnikov, Alexander
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Vanderbilt Univ., Nashville, TN, USA
Abstract :
The Markov blanket of a target variable is the minimum conditioning set of variables that makes the target independent of all other variables. Markov blankets inform feature selection, aid in causal discovery and serve as a basis for scalable methods of constructing Bayesian networks. We apply decision tree induction to the task of Markov blanket identification. Notably, we compare (a) C5.0, a widely used algorithm for decision rule induction, (b) C5C, which post-processes C5.0 ´s rule set to retain the most frequently referenced variables and (c) PC, a standard method for Bayesian network induction. C5C performs as well as or better than C5.0 and PC across a number of data sets. Our modest variation of an inexpensive, accurate, off-the-shelf induction engine mitigates the need for specialized procedures, and establishes baseline performance against which specialized algorithms can be compared.
Keywords :
Markov processes; belief networks; decision trees; learning (artificial intelligence); Bayesian networks; Markov blanket identification; decision tree induction; feature selection; learning (artificial intelligence); Bayesian methods; Biomedical informatics; Computer science; Costs; Decision trees; Engines; Gene expression; Organisms; Performance evaluation; Testing;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250903