• DocumentCode
    2020206
  • Title

    A new method for finding generalized frequent itemsets in generalized association rule mining

  • Author

    Sriphaew, Kritsada ; Theeramunkong, Thanaruk

  • Author_Institution
    Inf. Technol. Program, Thammasat Univ., Pathumthani, Thailand
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    1040
  • Lastpage
    1045
  • Abstract
    Generalized association rule mining is an extension of traditional association rule mining to discover more informative rules, given a taxonomy. We describe a formal framework for the problem of mining generalized association rules. In the framework, The subset-superset and the parent-child relationships among generalized itemsets are introduced to present the different views of generalized itemsets, i.e. the lattice of generalized itemsets and the taxonomies of k-generalized itemsets respectively. We present an optimization technique to reduce the time consumed by applying two constraints each of which corresponds to each view of generalized itemsets. In the mining process, a new set enumeration algorithm, named SET is proposed. It utilizes these constraints to speed up the mining of all generalized frequent itemsets. By experiments on synthetic data, the results show that SET outperforms the current most efficient algorithm, Prutax, by an order of magnitude or more.
  • Keywords
    data mining; database management systems; database theory; knowledge based systems; optimisation; set theory; Prutax; databases; generalized association rule mining; generalized frequent itemsets; knowledge discovery; optimization; parent-child relationships; set enumeration algorithm; subset-superset relationships; Association rules; Constraint optimization; Dairy products; Data mining; Filters; Information technology; Itemsets; Lattices; Taxonomy; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 2002. Proceedings. ISCC 2002. Seventh International Symposium on
  • ISSN
    1530-1346
  • Print_ISBN
    0-7695-1671-8
  • Type

    conf

  • DOI
    10.1109/ISCC.2002.1021800
  • Filename
    1021800