DocumentCode
2129396
Title
A Case Study on Classification Reliability
Author
Dai, Honghua
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
69
Lastpage
73
Abstract
The reliability of an induced classifier can be affected by several factors including the data oriented factors and the algorithm oriented factors. In some cases, the reliability could also be affected by knowledge oriented factors. In this paper, we analyze three special cases to examine the reliability of the discovered knowledge. Our case study results show that (1) in the cases of mining from low quality data, rough classification approach is more reliable than exact approach which in general tolerate to low quality data; (2) Without sufficient large size of the data, the reliability of the discovered knowledge will be decreased accordingly; (3) The reliability of point learning approach could easily be misled by noisy data. It will in most cases generate an unreliable interval and thus affect the reliability of the discovered knowledge. It is also reveals that the inexact field is a good learning strategy that could model the potentials and to improve the discovery reliability.
Keywords
data mining; pattern classification; software reliability; classification reliability; data mining; data oriented factors; discovery reliability; knowledge discovery; knowledge oriented factors; learning strategy; rough classification approach; Bayesian methods; Conferences; Data engineering; Data mining; Decision trees; Induction generators; Information technology; Reliability engineering; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location
Pisa
Print_ISBN
978-0-7695-3503-6
Electronic_ISBN
978-0-7695-3503-6
Type
conf
DOI
10.1109/ICDMW.2008.97
Filename
4733923
Link To Document