• DocumentCode
    1065961
  • Title

    Specifying mining algorithms with iterative user-defined aggregates

  • Author

    Giannotti, Fosca ; Manco, Giuseppe ; Turini, Franco

  • Author_Institution
    CNR, Pisa, Italy
  • Volume
    16
  • Issue
    10
  • fYear
    2004
  • Firstpage
    1232
  • Lastpage
    1246
  • Abstract
    We present a way of exploiting domain knowledge in the design and implementation of data mining algorithms, with special attention to frequent patterns discovery, within a deductive framework. In our framework, domain knowledge is represented by way of deductive rules, and data mining algorithms are specified by means of iterative user-defined aggregates and implemented by means of user-defined predicates. This choice allows us to exploit the full expressive power of deductive rules without loosing in performance. Iterative user-defined aggregates have a fixed scheme, in which user-defined predicates are to be added. This feature allows the modularization of data mining algorithms, thus providing a way to integrate the proper domain knowledge exploitation in the right point. As a case study, we present how user-defined aggregates can be exploited to specify and implement a version of the a priori algorithm. Some performance analyzes and comparisons are discussed in order to show the effectiveness of the approach.
  • Keywords
    data mining; deductive databases; knowledge based systems; logic programming languages; pattern recognition; query languages; query processing; tree data structures; association rules; constraint language; data mining algorithm modularization; deductive rules; domain knowledge exploitation; iterative user-defined aggregates; logic language; pattern discovery; query language; rule-based databases; user-defined predicates; Aggregates; Algorithm design and analysis; Association rules; Data mining; Database languages; Deductive databases; Helium; Iterative algorithms; Logic; Performance analysis; 65; Index Terms- Data mining; association rules.; constraint and logic languages; query languages; rule-based databases; user-defined aggregates;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2004.64
  • Filename
    1324631