Title :
Statistical considerations in learning from data
Author :
Kyburg, Henry E., Jr.
Author_Institution :
Rochester Univ., NY, USA
Abstract :
In this paper, we focus on statistics. Classical statistics and Bayesian statistics are both employed in data mining. Both have advantages but both also have severe limitations in this context. We point out some of these limitations as well as some of the advantages. The fact that we may need to take account of evidence both internal and external to the data set presents a difficulty for classical statistics. The need to incorporate an objective measure of reliability creates a difficulty for Bayesian statistics. We outline an approach to uncertainty that promises to capture the best of both worlds by incorporating both background knowledge and objectivity
Keywords :
Bayes methods; data mining; learning (artificial intelligence); reliability theory; statistics; uncertainty handling; Bayesian statistics; background knowledge; classical statistics; data mining; learning from data; objective reliability measure; objectivity; uncertainty; Bayesian methods; Cognition; Data mining; Databases; Humans; Probability distribution; Sampling methods; Sociotechnical systems; Statistics; Uncertainty;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989535