• DocumentCode
    389735
  • Title

    An effective projection-reduction algorithm for mining long frequents

  • Author

    Wang, Xuo-feng ; Wang, Tian-ran ; Zhao, Yue

  • Volume
    1
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    515
  • Abstract
    An effective projection-reduction algorithm for mining the long patterns frequent is presented. A new idea of top-down mining frequent items is adopted, and some new concepts such as transaction and items association information tables, key-items and reduction items, and projection DB are proposed. The algorithm presented is very effective for mining long frequents, and its validity is proved through analysis of computing complexity. Some examples of computation are given. The computing complexity of the algorithm is related to the average length of items reduction, the complexity approximates to 0.5 × M3N × 0(2s × N´2) in the worst case, where S is the average length of items reduction under min-support given by user, N´ is the number of tuples in the database, N is the number of transactions in the database and M is the average length of item sets in the database. Using heuristic information for pruning useless candidate frequent itemsets improves the efficiency of the algorithm. The algorithm is very effective for mining long frequents, where S is very short and the computing complexity approaches polynomial time.
  • Keywords
    computational complexity; data mining; database theory; algorithm efficiency; associate rules; computing complexity; heuristic information; items association information tables; key-items; long frequents mining; min-support; projection DB; projection-reduction algorithm; reduction items; top-down mining; tuples; Algorithm design and analysis; Automation; Chemical technology; Chromium; Data mining; Electronic mail; Information analysis; Itemsets; Polynomials; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
  • Print_ISBN
    0-7803-7508-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2002.1176809
  • Filename
    1176809