Title :
Leveraging D-Separation for Relational Data Sets
Author :
Rattigan, Matthew J H ; Jensen, David
Author_Institution :
Dept. of Comput. Sci., Univ. of Massachusetts, Amherst, MA, USA
Abstract :
Testing for marginal and conditional independence is a common task in machine learning and knowledge discovery applications. Prior work has demonstrated that conventional independence tests suffer from dramatically increased rates of Type I errors when naively applied to relational data. We use graphical models to specify the conditions under which these errors occur, and use those models to devise novel and accurate conditional independence tests.
Keywords :
data mining; learning (artificial intelligence); relational databases; D-separation; conditional independence; graphical models; knowledge discovery; machine learning; relational data sets; testing marginal;
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2010.142