• DocumentCode
    3164900
  • Title

    Dominance Programming for Itemset Mining

  • Author

    Negrevergne, Benjamin ; Dries, Anton ; Guns, Tias ; Nijssen, Siegfried

  • Author_Institution
    Dept. of Comput. Sci., KU Leuven, Leuven, Belgium
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    557
  • Lastpage
    566
  • Abstract
    Finding small sets of interesting patterns is an important challenge in pattern mining. In this paper, we argue that several well-known approaches that address this challenge are based on performing pair wise comparisons between patterns. Examples include finding closed patterns, free patterns, relevant subgroups and skyline patterns. Although progress has been made on each of these individual problems, a generic approach for solving these problems (and more) is still lacking. This paper tackles this challenge. It proposes a novel, generic approach for handling pattern mining problems that involve pair wise comparisons between patterns. Our key contributions are the following. First, we propose a novel algebra for programming pattern mining problems. This algebra extends relational algebras in a novel way towards pattern mining. It allows for the generic combination of constraints on individual patterns with dominance relations between patterns. Second, we introduce a modified generic constraint satisfaction system to evaluate these algebraic expressions. Experiments show that this generic approach can indeed effectively identify patterns expressed in the algebra.
  • Keywords
    data mining; relational algebra; dominance programming; dominance relations; generic approach; generic constraint satisfaction system; itemset mining; pattern mining problems; pattern pair wise comparison; relational algebra; Algebra; Data mining; Frequency measurement; Generators; Itemsets; Programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2013.92
  • Filename
    6729540