DocumentCode :
3261587
Title :
A Study on Reliability in Graph Discovery
Author :
Dai, Honghua
Author_Institution :
Sch. of Enginering & Inf. Technol., Deakin Univ., Burwood, Vic.
fYear :
2006
fDate :
Dec. 2006
Firstpage :
775
Lastpage :
779
Abstract :
A critical question in data mining is that can we always trust what discovered by a data mining system unconditionally? The answer is obviously not. If not, when can we trust the discovery then? What are the factors that affect the reliability of the discovery? How do they affect the reliability of the discovery? These are some interesting questions to be investigated. In this paper the authors provide a definition and the measurements of reliability, and analyse the factors that affect the reliability. We then examine the impact of model complexity, weak links, varying sample sizes and the ability of different learners to the reliability of graphical model discovery. The experimental results reveal that (1) the larger sample size for the discovery, the higher reliability we will get; (2) the stronger a graph link is, the easier the discovery will be and thus the higher the reliability it can achieve; (3) the complexity of a graph also plays an important role in the discovery. The higher the complexity of a graph is, the more difficult to induce the graph and the lower reliability it would be
Keywords :
data mining; graph theory; software metrics; software reliability; data mining; graph discovery; model complexity; Australia; Bayesian methods; Conferences; Data mining; Graphical models; Information technology; Machine learning; Stability; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
Type :
conf
DOI :
10.1109/ICDMW.2006.23
Filename :
4063730
Link To Document :
بازگشت