• DocumentCode
    988025
  • Title

    Database mining: a performance perspective

  • Author

    Agrawal, Rakesh ; Imielinski, Tomasz ; Swami, Arun

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    5
  • Issue
    6
  • fYear
    1993
  • fDate
    12/1/1993 12:00:00 AM
  • Firstpage
    914
  • Lastpage
    925
  • Abstract
    The authors´ perspective of database mining as the confluence of machine learning techniques and the performance emphasis of database technology is presented. Three classes of database mining problems involving classification, associations, and sequences are described. It is argued that these problems can be uniformly viewed as requiring discovery of rules embedded in massive amounts of data. A model and some basic operations for the process of rule discovery are described. It is shown how the database mining problems considered map to this model, and how they can be solved by using the basic operations proposed. An example is given of an algorithm for classification obtained by combining the basic rule discovery operations. This algorithm is efficient in discovering classification rules and has accuracy comparable to ID3, one of the best current classifiers
  • Keywords
    database management systems; decision theory; knowledge based systems; learning (artificial intelligence); performance evaluation; DBMS mining; ID3; associations; classification; database mining; decision trees; knowledge discovery; machine learning techniques; performance perspective; rule discovery; sequences; Classification algorithms; Classification tree analysis; Data processing; Databases; Decision trees; Gold; History; Humans; Machine learning; Marketing and sales;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.250074
  • Filename
    250074