DocumentCode :
3536146
Title :
Revisiting Interestingness Measures for Knowledge Discovery in Databases
Author :
Vashishtha, Jyoti ; Kumar, Dharminder ; Ratnoo, Saroj
Author_Institution :
Dept. of CSE, GJUST, Hisar, India
fYear :
2012
fDate :
7-8 Jan. 2012
Firstpage :
72
Lastpage :
78
Abstract :
The voluminous amount of data stored in databases contains hidden knowledge which could be valuable to improve decision making process of any organization. As it is not humanely possible to analyze large databases, it has become essential to apply advanced data mining algorithms for extracting patterns (models) from data to support decision making. A number of data mining algorithms produce information of a statistical nature that allows the user to assess how accurate and reliable the discovered knowledge is? However, in many cases this is not enough for the users. Even if the discovered knowledge is highly accurate from a statistical point of view, it might not be interesting to the user. Therefore the process of knowledge discovery in databases (KDD) aims at discovering knowledge that is interesting and useful to the user. Most of the data mining algorithms so far have paid lot of attention to discovery of accurate and comprehensible knowledge. Though, the question of interestingness has been addressed time to time, it is being increasingly realized by data mining community that this subject needs a renewed focus. This paper is an attempt to review the measures of interestingness used in the data mining literature. The main contribution of the paper is to improve the understanding of interestingness measures for discovery of knowledge and identify the unresolved problems to set the directions for the future research in this area.
Keywords :
data mining; database management systems; decision making; decision support systems; pattern classification; statistical analysis; data mining algorithm; data storage; decision making support; hidden knowledge; interestingness measure; knowledge discovery in databases; pattern extraction; Area measurement; Birds; Data mining; Databases; Production; Reliability; Semantics; Data Mining; Interestingness Measures; Rule Discovery;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing & Communication Technologies (ACCT), 2012 Second International Conference on
Conference_Location :
Rohtak, Haryana
Print_ISBN :
978-1-4673-0471-9
Type :
conf
DOI :
10.1109/ACCT.2012.97
Filename :
6168336
Link To Document :
بازگشت