DocumentCode
988025
Title
Database mining: a performance perspective
Author
Agrawal, Rakesh ; Imielinski, Tomasz ; Swami, Arun
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
Volume
5
Issue
6
fYear
1993
fDate
12/1/1993 12:00:00 AM
Firstpage
914
Lastpage
925
Abstract
The authors´ perspective of database mining as the confluence of machine learning techniques and the performance emphasis of database technology is presented. Three classes of database mining problems involving classification, associations, and sequences are described. It is argued that these problems can be uniformly viewed as requiring discovery of rules embedded in massive amounts of data. A model and some basic operations for the process of rule discovery are described. It is shown how the database mining problems considered map to this model, and how they can be solved by using the basic operations proposed. An example is given of an algorithm for classification obtained by combining the basic rule discovery operations. This algorithm is efficient in discovering classification rules and has accuracy comparable to ID3, one of the best current classifiers
Keywords
database management systems; decision theory; knowledge based systems; learning (artificial intelligence); performance evaluation; DBMS mining; ID3; associations; classification; database mining; decision trees; knowledge discovery; machine learning techniques; performance perspective; rule discovery; sequences; Classification algorithms; Classification tree analysis; Data processing; Databases; Decision trees; Gold; History; Humans; Machine learning; Marketing and sales;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.250074
Filename
250074
Link To Document